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PREFACE 


Graph theory provides simple, accessible, and powerful tools 
for constructing models and solving problems having to do with 
discrete arrangements of objects. Technology today poses a 
great number of problems that require the construction of complex 
systems through specific arrangements of their components. 
These problems range from the scheduling of industrial processes 
to PERT and critical-path analyses, to tactics and logistics, to 
communication systems, to the study of information transmittal, 
to the choice of optimum routes and network flows, to electrical 
networks, to organic-chemical identification, to switching systems, 
to economics, to social-group structures, to games and puzzles, ete. 
The applications are numerous. Combinatorial construction 
techniques for finding appropriate arrangements are considerably 
different from the arithmetic computational approach to systems 
through the equations of classical analysis. In addition to the use 
of graphs, the subject of arrangements can be approached through 
matrices whose entries are zeros and ones, or simply through the 
analysis of specific finite families of sets. 

It can be truly said that graph theory is one of the simplest, 
most elegant subjects of modern mathematics which possesses a 
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wide variety of applications. Based on the simple idea of points 
interconnected by lines, graph theory combines these basic in- 
gredients into a rich assortment of forms and endows these forms 
with flexible properties, thus making the subject a useful tool for 
studying many kinds of systems. In addition, graph theory has 
contributed to the analysis of a wide variety of combinatorial 
problems. 

The term network is frequently used instead of graph, espe- 
cially when quantitative characteristics are imparted to the points 
and lines, in addition to the purely structural relationships that are 
the defining characteristics of a graph. One speaks, for example, 
of electrical networks, project networks, and flow networks, in 
which quantitative measures of energy, effort, and flow, respee- 
tively, are associated with the edges. 

This book is specially written for the teacher desiring to fulfill 
the need of the undergraduate and first-year graduate student in 
mathematics, physical and social sciences, engineering, economies, 
and operations research to get a relatively broad idea of graph 
theory and to receive such varied exposure to its application that 
his creative talent may thereby be inspired. Indeed this has been 
our experience with students who learned their graph theory from 
this book and went on to apply it in their work and to conduct 
research on the many interesting problems in the field. The 
manuscript is an outgrowth of a semester course taught at the 
George Washington University, at the Graduate School of the 
Department of Agriculture, and at various other institutions over- 
seas over a period of several years. 

In view of the basic nature and broad applicability of graph 
theory, it is clearly desirable for anyone engaged in or preparing 
for work in the general area of science or engineering to become 
familiar with at least the fundamentals of this topic. It is the 
objective of this book to provide such familiarity, in a form suit- 
able as the basis for an introductory course in graph theory or for 
self-study. Despite the abundance of journal articles dealing with 
specialized topics, the available expository literature does not meet 
all needs. 

In attempting to meet this objective, we have chosen for the 
most part to place the material in the setting of a relatively in- 
formal discussion of central ideas, amplified and illustrated by a 
variety of applications. By so doing, it is hoped that the interested 
reader will be left (a) in a position to relate and adapt these basic 
concepts to his own field of application, and (6) in a frame of mind 
to pursue the matter further, convinced of its importance. 
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The book is organized into two parts. Part I, consisting of 
Chapters 1 through 5, presents basic theoretical material. Chap- 
ter 1 contains the fundamental definitions, terms, and symbols 
required to describe and classify undirected graphs. It is at the 
end of this chapter that we have included a large number of refer- 
ences of well-known books on and related to the subject. The 
bibliography of Moon and Moser which appears at the end of 
Ref. 8a is thorough and indicative of the fantastic range of the 
subject. Chapter 2 serves a similar function for oriented or di- 
rected graphs. In Chapter 3 the basic development of concepts 
is continued, centering around various ways of partitioning the 
elements of a graph and the measurement of distances in graphs. 
Chapter 4 discusses properties and characterizations of the im- 
portant class of planar graphs, as well as the class of problems that 
have come to be known as coloring problems. In Chapter 5, 
which concludes Part I, the emphasis shifts from geometric to 
algebraic considerations, with a discussion of the role of matrices 
in the characterization and structural analysis of graphs. 

Part II, consisting of Chapters 6 and 7, illustrates the appli- 
cation of the foregoing theory in a variety of contexts. Chapter 
6 in particular (long and rich in variety) contains brief discussions 
of the utility of graph theory in numerous connections. Chapter 
7 considers linear flows through constrained networks from a graph- 
theoretic point of view. General procedures for maximizing flow 
and minimizing cost are developed. A brief introduction to sto- 
chastie network flows is given, relating the ideas to queueing theory. 

The only prerequisites are a familiarity with basic terminology 
of set theory and a knowledge of elementary concepts related to 
matrices and vector spaces. 

An interesting philosophical observation that one might make 
is based on Theorem 1-2 given in Chapter 1 of the possibility of 
representing any graph with a continuum number of vertices and 
edges in three-dimensional space without intersections. While 
one might gain in simplicity in representing flows by going to 
higher dimensions, the analytical concepts of flow would not 
necessarily be enriched by doing so. The significance of this ob- 
servation becomes apparent when one considers that man can 
largely be characterized in terms of various flows in his nervous 
system (with its infinite possibilities and multidimensional ab- 
stractions) which he uses to organize and relate himself to his 
environment. It would seem that three dimensions is also ade- 
quate and economical for the realization of the “flow-system man.” 

We are grateful to Mr. Jack Edmonds for many helpful sug- 
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gestions and useful additions, particularly to Chapter 6, and to 
Dr. Horace Trent for very helpful ideas and profitable discussions 
on varied topics. For additional suggestions, particularly in 
Chapter 6, and some help in partial collecting and writing, our 
thanks also go to Dr. David Rosenblatt (Section 6-2), Mr. 
Charles MacLean (6-10), Dr. Kurt Fey (6-4), Mr. John Boushka, 
Miss Susan Meador (6-22), Mr. William D. Murray (6-28), Mr. 
P. C. Ryan, and Mr. Edward Stern. 


ROBERT G. BUSACKER 
THOMAS L. SAATY 
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PARTI 


BASIC THEORY 


This part of the book contains five chapters. Chapters 1 and 2 
contain fundamental definitions and theorems concerning undirected 
and directed graphs, respectively. In Chap. 3 this basic develop- 
ment is continued, revolving around various methods of partition- 
ing and measuring distances in graphs. Chapter 4 is concerned 
with planar graphs and coloring problems of which the classical 
four-color problem is the outstanding example. In Chap. 5 the 
emphasis shifts from geometric to algebraic considerations, and 
certain properties of graphs are investigated through their repre- 
sentative matrices. 
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BASIC CONCEPTS: 
UNDIRECTED GRAPHS 


1-1 INTRODUCTION 


In this chapter the concept of a graph is introduced, first as a geo- 
metric structure consisting of points (vertices) in space intercon- 
nected by a system of curves (edges) and then in abstract terms. 
The basic terminology and symbols needed to discuss fundamen- 
tal local and global structural properties of graphs are then 
introduced and discussed. Thus this chapter—together with the 
next, dealing with graphs whose connecting edges are oriented— 
establishes the basic vocabulary for describing graphs. In order 
to occasionally relieve the monotony of these necessary prelim- 
inaries, a number of basic results which follow rather directly 
from the definitions are included. 

Because the terminology and symbolism currently in use in 
graph theory is not standardized, the reader is urged to study the 
conventions of these two chapters carefully before proceeding. 
Because they are one-dimensional configurations, graphs are 
often referred to as linear. 
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1-2 GEOMETRIC GRAPHS 


Before introducing the concept of a graph in its most general form, we 
consider a class of graphs known as geometric graphs. By doing so, we 
shall have from the outset a convenient visualization of the various con- 
cepts and structures to be considered. It will be shown presently that 
every graph of interest is abstractly identical (with respect to those 
properties of interest in graph theory) with an appropriate geometric 
graph. Thus geometric graphs may be regarded as convenient representa- 
tions of all graphs, and not merely as specialized examples. 

Let §" denote n-dimensional euclidean space. (We shall be primarily 
interested in 2- and 3-dimensional spaces as a result of Theorems 1-1 and 
1-2 discussed later.) Euclidean n space is the space of all sequences of 
n real numbers x = (21, .. . jn), With distance between two points 


a= (m1, .. « sn) ¥ = (Yay - » + Yn) given by d(w,y) = [ )) @ — yd? ]. 
i=1 


A simple open curve in &" is a continuous, non-self-intersecting curve 
joining two distinct points in 6" (i.e., a curve obtainable by continuous 
deformation of a straight-line segment), Similarly, a simple closed curve 
is essentially a continuous, non-self-intersecting curve whose end points 
coincide, 

A geometric graph in &" is a set V = {v;} of points in &" and a set 
E = {e;} of simple curves satisfying the following conditions: 


1. Every closed curve in # contains precisely one point of V. 

2. Every open curve in E contains precisely two points of V, and 
these agree with its end points. 

3. The curves in Z have no common points, except for points of V. 


Thus a geometric graph is simply a geometric configuration or struc- 
ture in &" consisting of a set of points interconnected by a set of non- 
intersecting continuous curves. 

Many familiar structures may (when idealized) be regarded as geo- 
metric graphs and studied by applying the methods developed in sub- 
sequent pages. A system of highways, for example, can be idealized as a 
graph by ignoring the width of roads and considering junctions to be 
points. Many additional examples of graphlike structures in the real 
world will be introduced in subsequent pages. 

Figure 1-1 depicts a geometric graph and illustrates the manner in 
which graphs will usually be represented. In the context of graph 
theory, the elements of V and Z are respectively called geometric vertices 
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Figure 1-1 


and geometric edges. We could, at this time, introduce the various des- 
eriptive terms which make up the basic vocabulary of graph theory. 
(For instance, in Fig. 1-1, edges es and e; are “‘parallel,”’ vertex vs is ‘“‘iso- 
lated,” vs and v5 are “adjacent,” etc.) Before doing so, we redefine graphs 
in general terms, so that these terms need be introduced only once and 
in a general setting. 


1-3 ABSTRACT GRAPHS 


While many of the graphs arising in applications are (when idealized) 
geometric graphs, the only essential structural feature in the context of 
graph theory is the fact that each geometric edge has associated with it 
two (possibly coincident) geometric vertices. Graph theory focuses on 
these associations and disregards the nature of vertices and edges. Thus 
the enumeration of edges and vertices given in the following table con- 
tains all of the essential information concerning the geometric graph of 
Fig. 1-1. 


Edges Corresponding vertices 
e Vay U6 
eo Vay U3 
63 Vay Vs 
e& v1 
es iy V4 
es Va V6 


To facilitate the general definition of a graph, we first introduce the 
concept of the unordered product of a set with itself. Recall that the 
ordered or cartesian product of a set S with itself, denoted by S x 3S, is 
defined to be the set of all ordered pairs (s,), where se SandteS. Here 
(s,t) and (t,s) are considered to be distinct entities except when s = t. 
In a similar vein, the symbol (s & ¢) will denote an unordered pair of 
elements of S, and the set of all distinct unordered pairs will be denoted 
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by S & S and called the unordered product of S with itself. Here (s & t) 
and (i & s) are equivalent symbols for the same entity and, as in the case 
of (s,t), it is permissible that s = ¢. Note that if S has k elements, SxS 
consists of k? ordered pairs while S & S consists of k(k + 1)/2 distinct 
unordered pairs. 

An abstract graph, or simply a graph, may now be defined as follows: 
A graph consists of a nonempty set V, a (possibly empty) set E disjoint 
from V, and a mapping # of H into V & V. The elements of V and Z are 
called the vertices and edges of the graph, respectively, and # is called the 
incidence mapping associated with the graph. 

If e¢ E and v and w are the vertices such that #(e) = (v & w), then 
edge ¢ is said to be incident with each of vertices v and w, and vice versa. 
All other vertices are regarded as being nonincident with e. The vertices 
incident with an edge are called its end points, and are also said to be 
joined by the edge. 

While the incidence relationship is fundamental to the concept of a 
graph, in many contexts there is no need to refer to ® explicitly. In such 
cases, the fact that v and w are the end points of eis denoted by e ~ (v & w), 
read as “‘e joins v and w.” 

A graph will usually be denoted by G@ or (V,E,®), or by (V,E) when 
the incidence mapping remains implicit. Note that /, but not V, may 
be empty. While graphs with no edges are uninteresting in themselves, 
their inclusion is occasionally useful, as, for example, in connection with 
constructive (or, more accurately, destructive) procedures involving the 
successive deletion of edges from a given graph. A graph is said to be 
degenerate if and only if it has no edges. 

If V and BZ are both finite sets (the empty set being included as a 
finite set), G is called a finite graph. Otherwise it is said to be infinite. 

The introduction of the concept of abstract graphs serves a purpose 
aside from stripping the incidental geometric features away from the 
essential combinatorial characteristics of a graph. It enlarges the pros- 
pects for applications, since many real-world entities have the combina- 
torial characteristics necessary to be viewed profitably as graphs. For 
example, the relationships between the component individual activities 
which make up a complex project can be depicted as a graph. Here the 
edges (when assigned orientations or directions as in Chap. 2) represent 
the individual activities, and the incidences within the graph reflect the 
way in which the execution of certain tasks is contingent on the prior 
completion of other tasks. We shall return to this important class of 
project graphs in Sec. 6-4. Note that in this case a geometric visualiza- 
tion of the situation is simply a (convenient) artifice and not an idealiza~ 
tion of actual physical connections between objects in space. 
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1-4 ISOMORPHISMS AND REALIZATIONS 


A geometric graph G is clearly a particular instance of a graph, in which 
the vertices and edges are respectively points and simple curves in 8" and 
in which 6(e) = (v & w) is understood to mean that v and w are the end 
points of curve e or that v is the sole vertex contained in closed curve e 
ify = w. 

It was remarked earlier that every graph is abstractly identical 
with—or, to use a more customary term, isomorphic to—an appropriate 
geometric graph. Isomorphism between graphs is formally defined as 
follows: Graphs G = (V,£) and G’ = (V’,E’) are said to be isomorphic to 
each other if there exist 1-1 correspondences between V and V’ and 
between # and H’ which preserve incidences. In other words, edge ¢ is 
incident with vertex v in G if and only if the corresponding elements e’ and 
v’ are incident in G’. If @ is isomorphic to a geometric graph G’, then G’ 
is called a geometric realization of G. (In particular, a geometric graph 
may be considered as a geometric realization of itself.) 

A graph is said to be planar if and only if it has a geometric realiza- 
tion in &*. For example, the graph G shown in Fig. 1-2a is planar, since 
it is isomorphic to the graph G’ shown in (b). This figure illustrates an 
evident but important fact: A geometric graph may be planar even though 
it cannot be deformed by a continuous transformation into a planar 
graph. While G and G’ have important distinctive features from the 
standpoint of topology, they are equivalent from the standpoint of graph 
theory. 

Figure 1-3 depicts a nonplanar graph: one having no geometric reali- 
zation in &*. This graph is, in fact, one of two fundamental configura- 
tions which characterize all nonplanar finite graphs. This fact, embodied 
in an important theorem due to Kuratowski, is established in Chap. 4, 
where the distinctive features of planar graphs are investigated. In con- 
% 


¢,/ 
e 1 
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Figure 1-2 
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trast to the fact that only a restricted class of 
finite graphs have realizations in &?, we have 
the following: 

Theorem 1-1 Every finite graph 
G = (V,E) has a geometric realization in &*. 

Proof: Let L denote an arbitrarily chosen 
line in &*. Corresponding to each vertex v« V 
select a distinct point v’ on L, and correspond- 
ing to each unordered pair (vp &w)e V&V 
select a distinct half plane H,«. having L as 
its boundary. For each edge e in @ such that 
e~ (v&w) construct in He. a simple curve 
é’ joining v’ and w’ and having no other points in common with L or 
with other curves in Hyg». (This is clearly possible.) The resulting 
structure is a geometric realization of G. 

While the validity of Theorem 1-1 is self-evident, we have indicated 
this construction primarily because it can be extended in a straightfor- 
ward manner to establish the following less obvious theorem, which com- 
pletely characterizes realizable graphs: 

Theorem 1-2 A graph G = (V,E) has a geometric realization in 
8° if and only if the elements of V and also those of H can be placed in 1-1 
correspondence with a subset of the set of all real numbers. 

(For the reader familiar with the concept of cardinal number, @ is 
realizable in &? if and only if neither V nor Z has cardinality exceeding 
that of the continuum.) 

In this book we shall be almost exclusively concerned with finite 
graphs, and never with graphs which do not meet the conditions of 
Theorem 1-2. Moreover, the structural properties to be studied are all 
preserved by isomorphisms, since they all rest, ultimately, on the concept 
of incidence. It follows that we can, without any real loss of generality, 
think entirely in terms of geometric graphs in &*. In particular, all illus- 
trations of graphs will be geometric graphs. 

Sketch of proof of Theorem 1-2: The points in &" can be placed in 1-1 
correspondence with the real numbers. 

Clearly, a graph G = (V,Z) is not realizable in & (or, in fact, in & 
for any positive integer n) if either the vertices or the edges cannot be 
placed in 1-1 correspondence with a subset of the points in 6. On the 
other hand, if such correspondences exist, it is possible (referring to the 
terminology in the proof of Theorem 1-1) to select a distinct point v’ on L 
for each vertex and a distinct half plane H,sw for each unordered pair of 
vertices. This having been done, the curves in Hygw can be constructed 
according to the scheme shown in Fig. 1-4. Hach point on the segment 


Figure 1-3 
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M determines a simple curve (a broken line segment or circle) disjoint 
from other such curves except for v and w’. 


1-5 TERMS DESCRIBING LOCAL STRUCTURE 


In order to describe various structural features of graphs efficiently, the 
following terms are useful. They are all particularly transparent in the 
context of geometric graphs. 

If e~ (v& w), v and w are called the end points of e, whether the 
graph under consideration is geometric or not. If » = w, then v is the 
sole end point of e and e is called a loop. If e.~ (v& w) and eg ~ 
(v & w), then e; and és are called parallel edges. In particular, two loops 
incident with the same vertex are parallel. Vertices v and w are said to 
be adjacent vertices if e ~ (v & w) for at least one edge e. In particular, 
v is adjacent with itself if there is a loop incident with v, but not otherwise. 
Similarly, edges e; and es are adjacent edges if they have at least one com» 
mon end point. Note that adjacency is a relationship between two like 
elements (either vertices or edges), while incidence is a relationship 
between unlike elements. 

The number of edges incident with v, with loops counted twice, is 
called the degree of v and denoted by 6(v). A vertex v is said to be 
isolated if 6(v) = 0. Thus, in particular, a degenerate graph is one for 
which every vertex is isolated. 

If S is any finite set, |S| denotes the number of elements in S. In 
particular, |V| and |£| denote the number of vertices and edges, respec- 
tively, of the finite graph G = (V,Z), Since every edge contributes 1 to 
the degree of two vertices (or, in the case of a loop, 2 to the degree of a 
single vertex), we have 


x, 5(v) = 2|B| 
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Figure 1-5 


If Vo and V; are the sets of vertices having even and odd degree, 
respectively, then clearly »» 4(v) is even, since it is a finite sum of even 
ve 


numbers. It follows that 


»» 6(v) — y 6(v) = »» 6(v) 


veVo 


is also necessarily even, which establishes the following: 

Theorem 1-3 In a finite graph, the number of vertices of odd 
degree is even. 

In order to fix the terminology of this section firmly in the reader's 
mind, these terms are now illustrated in the context of the graph of Fig. 
1-5. The end points of e: are vz and v2, and the loop es has the single end 
point 7. Edge éy is adjacent with e; and parallel with es. (Note that 
parallel edges are also adjacent.) Vertex v; is adjacent with vs and with 
itself, but v4 is not adjacent with itself. Vertex vs is an isolated vertex. 
Four vertices (namely, v4, v2, vs, and v4) have odd degree. 

Given a graph @ = (V,Z,®), the system G; = (V1,2,,%) is called a 
subgraph of G if and only if the following conditions are met: 


1. ViC Vand £, C £. 
2. &,(e) = B(e) for every ee Fi. 
3. If ee Ey and ®(e) = (v & w), then ve Vi and we Vi. 


Expressed differently, a subgraph of G consists of selected edges and 
vertices of G, with the same incidences as those of G, and with the reason- 
able requirement that the selected vertices must include all of the end 
points of the selected edges. The graph Gis also called a supergraph of G1. 


Exercise 1-1 Interpret the various terms connected with the local structure of a 
graph in the context of a graph whose vertices and edges represent the junction points 
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and individual segments of a system of city streets. (The reader may also wish to 
anticipate certain global characteristics to be introduced shortly. For example, 
what would be a reasonable definition for the property of connectedness in graphs?) 


1-6 EDGE PROGRESSIONS, CHAINS, AND CIRCUITS 


In a geometric graph one can visualize starting at a given vertex, travers- 
ing a sequence of geometric edges in a continuous manner, and ultimately 
arriving at another specified vertex or returning to the original vertex. 
Sequences of edges which form continuous routes play a fundamental role 
in graph theory. In particular, the familiar structures which we shall call 
chains and circuits, in which no edge is traced more than once, enter in 
some way into nearly all aspects of graph theory. The above intuitive 
procedure suggests the following formal definition: 

A finite sequence @, é:, . . . , én of (not necessarily distinct) edges 
of a graph is said to constitute an edge progression of length n if there 
exists an appropriate sequence of n + 1 (not necessarily distinct) vertices 


Uo, V1, - « « » U Such that 
6 ~ (vi-1 & 4) fort =.1, 2)... ji 


The edge progression is said to be closed if vo = vn and open if vo ¥ Un. 
In the latter case the progression is said to extend from vo to vn. Note 
that an individual edge may be considered as an edge progression of 
length 1. 

(Edge progressions are frequently referred to as edge sequences in the 
literature. However, since most sequences of edges fail to be edge 
sequences in this special sense, we have avoided this terminology.) 

Referring to Fig. 1-6, the sequence e7, ¢1, és, €3, 4, és constitutes an 
open edge progression from v2 to vs of length 6, the corresponding sequence 
of vertices being v2, vs, Us, U6, V1, Us, Vs If we replace es by e; as the last 


Figure 1-6 
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term of the progression, we have an example of a closed edge progression 
of length 6. 

If all elements of the progression represent distinct edges, the pro- 
gression is called a chain progression if it is open and a circuit progression 
if it is closed. The set of edges itself, without regard to sequencing, is 
said to constitute a chain in the former case and a circuit in the latter. 
For example, the set of edges {¢s,¢s,¢7,es} in Fig. 1-6 constitutes a chain, 
since the sequence ey, és, és, €7 is a chain progression having vs, v1, Ue, Us, U2 
as the corresponding sequence of vertices. This same chain is also asso- 
ciated with the progression és, és, €4, €7, Which has v5, v6, 01, Us, V2 as the 
associated sequence of vertices. 

In some contexts it is important to distinguish between alternative 
ways of ordering the edges of a chain or circuit to form a progression; in 
others the ordering is immaterial so long as at least one such ordering 
exists. Both situations occur often enough to warrant the double 
terminology (e.g., chain progression vs. chain) introduced here. 

Tf all the n + 1 vertices vo, v1, . . . , ¥n are distinct (in which case 
the edges are necessarily also distinct), the edge progression is called a 
simple chain progression and the set of edges—without regard to order—is 
called a simple chain. If vo = v» but the vertices are otherwise distinct, 
the progression is called a simple circuit progression and the unordered set 
of edges is said to constitute a simple circuit. Note that in a geometric 
graph simple chains determine simple open curves, as defined earlier, 
while simple circuits determine simple closed curves. 

It is instructive to interpret the concepts of progression, chain, and 
circuit in various specific application contexts. Consider, for example, a 
graph whose vertices represent the individuals in an organization and 
whose edges correspond to pairs of individuals between whom direct com- 
munication is possible. Describe the movement of information within 
such a structure in terms of progressions, chains, and circuits. 


Exercise 1-2 Prove that every open edge progression from » to w includes, as a 
subsequence, a simple chain progression from » to w. In particular, every nonsimple 
chain progression contains such a simple chain progression. 


Exercise 1-3 Show, with an example, that the following is not true in general: 
If P is an edge progression from 1: to v2 which passes through v3, where v3 # v1 OF vs, 
then P includes, as a subsequence, a simple chain progression from 1 to v2 which also 
passes through »,. 


Exercise 1-4 Why is the following characterization inadequate? An edge pro- 
gression is a sequence of edges such that adjacent terms of the sequence correspond to 
adjacent edges of the graph under consideration. 


Exercise 1-5 Prove that if v1, vs, and vs are distinct vertices such that v, and ve 
are joined by a chain and v2 and v, are joined by a chain, then v: and v; are joined by a 
chain. 
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Exercise 1-6 Under the assumptions of Exercise 1-5, give an example to show 
that there does not necessarily exist a simple chain joining »; and v; which passes 
through (i.e., is incident with) vs. 


Exercise 1-7 Prove that if the edges of a chain can be ordered in more than two 
ways to form a chain progression, then the chain is not a simple chain. 


Exercise 1-8 The converse of the statement in Exercise 1-7 is not true in general. 
That is, there exist chains which are not simple but which cannot be ordered in more 
than two ways to form a chain progression. Provide an example, and try to char- 
acterize all such chains. 


Exercise 1-9 Prove that every nonsimple circuit can be partitioned into two or 
more simple circuits. 


Exercise 1-10 Prove that every nonsimple chain joining v and w can be parti- 
tioned into a simple chain joining v and w together with one or more simple circuits. 
Thus only simple chains are minimal in the sense that they contain no proper subset of 
edges connecting their end points. 


Exercise 1-11 If “nonsimple chain” is replaced by “open edge progression” in 
Exercise 1-10, or if ‘“‘nonsimple circuit’’ is replaced by “closed edge progression” in 
Exercise 1-9, the resulting statement is not true in general. Provide a counter- 
example for each. 


Exercise 1-12 Show that any finite set of nonnegative integers can be realized 
as the degrees of the vertices of an appropriate graph, provided that an even number of 
them are odd. 


Tf loops are not permitted, a realization such as that of Exercise 1-12 
is not generally possible. One must place more stringent requirements 
on the given set of integers. Tutte [The Factors of Graphs, Can. J. 
Math., 4: 314-828 (1952)] has developed necessary and sufficient condi- 
tions under which a given graph contains a subgraph having vertices of 
prescribed degrees. 


1-7 CONNECTIVITY 


A graph is said to be connected if every pair of distinct vertices are joined 
by at least one chain. Other graphs are said to be disconnected. For 
finite geometric graphs, these definitions conform with conventional 
usage; i.e., a finite geometric graph is connected in this graph-theoretic 
sense if and only if it is connected when considered as a point set. How- 
ever, this is not always the case for infinite graphs. Consider the geo- 
metric graph G = (V,£) in &, where V consists of all points with coor- 
dinates (x,y), = Oor1 and0 < y < 1, and where for each y the vertices 
(0,y) and (1,y) are joined by a straight edge. Viewed as a point set, @ 
consists of a unit square in &* and is connected. As a graph, however, it 
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is highly disconnected, since vertex (0,y) is joined to (1,y) but to no other 
vertex by a chain. 

As an alternative characterization of connectivity we have the follow- 
ing theorem: 

Theorem 1-4 A graph G = (V,E) is connected if and only if its 
vertices cannot be partitioned into two nonempty subsets V; and V¢ in 
such a way that both end points of every edge are in the same subset. 

Proof: Uf G is not connected, select a vertex v; and let V; consist of v; 
together with all vertices joined with »; by chains. Since @ is not con- 
nected, V;  V (why?), so the complementary set V2 = V — Vj is non- 
empty. Considering the way in which V; was defined, no edge joins a 
vertex in V; with one in V2, so a partitioning of the desired type has been 
obtained. 

Conversely, if such a partitioning exists, arbitrarily select v, « V, and 
vee Va. A chain joining v; and vp would necessarily contain at least one 
edge having end points in both V; and V2. Since no such edge exists, G 
is not connected, and the proof is complete. 

If G = (V,E) is any graph, consider the binary relation p defined 
between certain ordered pairs of vertices as follows: » p w if and only if 
either v = w or else v and w are joined by a chain. The relation p is 
clearly reflexive (v pv for every v), symmetric (vp w implies w pv), and 
transitive (up w whenever upv and vpw). Thus p is an equivalence 
relation, and it partitions V into uniquely determined equivalence classes 
of mutually related vertices. In Fig. 1-1, for example, these are {v2,v3,"}, 
{v1,v,}, and {v5}. Each equivalence class of vertices, together with those 
edges of H incident with these vertices, constitutes a connected subgraph 
called a component of G. A component G, of G is readily seen to be a 
maximal connected subgraph, in the sense that no proper supergraph of 
G;, is connected. 


Exercise 1-13 Prove that a connected graph remains connected after removal of 
an edge if and only if the edge is contained in some circuit. 


Exercise 1-14 Give an example of a connected graph which becomes discon- 
nected if any one edge is removed. (In view of Hxercise 1-13, the graph can have no 
circuits.) Such graphs, called trees, are considered in the next section. 


Exercise 1-15 Prove that a connected graph having k vertices must have at 
least k — 1 edges. 


Exercise 1-16 Prove that the set of all edges of a connected finite graph con- 
stitutes a simple circuit if and only if every vertex has degree 2. Formulate and prove 
a similar characterization of connected graphs whose edges constitute a simple chain. 


Exercise 1-17 Show that all the edges of a finite connected graph can be included 
in an appropriately chosen edge progression. (In Chap. 3 we shall determine the 
circumstances under which this can be done with a chain or circuit progression.) 
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1-8 TREES AND FORESTS 


A graph is said to be a tree if it is connected and has no circuits. More 
generally, a graph which is free of circuits and consists of k (connected) 
components is sometimes referred to as a forest of k trees. The concept 
of a tree plays a central role in many aspects of graph theory. A graph 
is a tree if and only if every pair of distinct vertices are joined by precisely 
one chain. (Connectivity implies the existence of at least one chain, 
while the absence of circuits implies the existence of at most one chain.) 

The removal of any one edge of a tree yields a disconnected graph, 
since the edge removed constituted the unique chain joining its end 
points. On the other hand, given any connected graph which is not a 
tree, it is possible to remove certain edges (namely, any edge included in a 
circuit) without destroying connectivity. Consequently, a tree may also 
be characterized as a minimal connected graph in the sense that it con- 
tains no proper subgraph which includes all of the vertices and is connected. 

If a tree 7' is a subgraph of a graph G, the edges of G which appear in 
T are called branches relative to T and the edges not included in 7’ are 
called chords relative to 7. If all vertices of G are included in 7, then T 
issaid to spanG. It is clear that only connected graphs contain spanning 
trees, and of these only trees contain unique spanning trees. 

In Fig. 1-7 two alternative spanning trees of the same graph are 
indicated by bold edges. The fact that each of these trees contains four 
edges is an instance of the following general property: 

Theorem 1-5 Every tree with n vertices has precisely n — 1 edges. 

Proof: Removal of any one edge divides a tree into two components, 
i.e., into a forest of two trees. (Why never more than two?) Similarly, 
removal of a second edge yields a forest of three trees. In general, after 
removal of any k — 1 edges, we are left with a forest of k trees. On the 
other hand, after removal of all edges, the forest clearly consists of n trees 
(each being an isolated vertex). It follows that the number of edges 
removed was necessarily n — 1. This completes the proof. 


(a) (b) 
Figure 1-7 
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T, T; qT, 


Figure 1-8 


By applying the preceding result to each tree of a forest, we obtain 
the following ‘“‘generalized” statement: 

Theorem 1-6 A forest of k trees which has n vertices has pre- 
cisely n — k edges. 

Given two trees which span the same graph, it is always possible to 
transform one into the other by constructing a sequence of spanning trees 
each of which has all but one edge in common with the preceding one. 
For example, consider the sequence 7, 7's, 73, 74, where T; is the tree 
shown in Fig. 1-7a and the remaining elements are shown in Fig. 1-8. 
(Note that 7's, Fig. 1-8, is the tree shown in Fig. 1-76.) 

The four trees constitute a ‘‘monotonic” transition from 7; to 7s, in 
the sense that each succeeding tree has one more edge in common with the 
final tree. In general, such a transition from one spanning tree 7’ to 
another 7” can always be achieved as follows: Let e; be any edge contained 
in T’ but not in 7. Then the (unique) chain in 7 which joins the end 
points of e; must contain at least one edge 2, not appearing in 7”, since 7” 
has no circuits. Construct the graph 7, which is identical with 7’ except 
that it includes e; and excludes @;. Then 7’; is necessarily also a spanning 
tree. (Why?) If 7; and 7” are not identical, repeat the process. If the 
underlying graph has k vertices, then each spanning tree has k — 1 edges 
and it will be necessary to generate at most k — 1 intermediate trees 
before T” is produced. 


1-9 DISCONNECTING SETS AND CUT - SETS 


In situations where the emphasis is on ways of separating sets of vertices 
from one another, the concept of a cut-set becomes important. For 
example, in the study of flows through networks, cross sections of the 
network separating flow origins from destinations are considered in order 
to find a restrictive cross section which constitutes a bottleneck. Such 
bottlenecks determine the flow capacity of the network as a whole. 
Before formally defining cut-sets, we introduce a more general con- 
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cept. IfG@ = (V,E£) isa connected graph and if F C Z, then F is calleda 
disconnecting set if and only if the subgraph G’ = (V, E — F) is not con- 
nected. Here H — F denotes the set of edges which are in # but not in F. 
Disconnecting sets always exist (if G has at least two vertices), since we 
may take = HE. (We assume that G has no loops, since they could play 
no role in connecting or disconnecting distinct vertices.) 

The dashed edges in Fig. 1-9 indicate two disconnecting sets in a 
graph G, the second of which is a subset of the first. The disconnecting 
set depicted in Fig. 1-9a divides the remaining subgraph into three com- 
ponents, one of which contains the vertices of the set W encircled in the 
figure. It is clear that in order to disconnect the graph, it would be suffi- 
cient to remove only those edges which join a vertex in W to one in 
W’ =V—W. These are the edges shown in Fig. 1-9b. 

In general, if G = (V,£) is connected and if V is partitioned into 
nonempty sets W and W’ = V — W, the set of edges joining W with W’ 
is called a cut-set. (For every W this set must be nonempty, since @ is 
connected.) For any given graph, the totality of eut-sets determined by 
various W’s forms a subclass of the class of all disconnecting sets and, 
moreover, every disconnecting set contains at least one cut-set as a subset. 

Of particular interest are disconnecting sets which are minimal in the 
sense that they contain no proper subsets which also disconnect the graph. 
Minimal disconnecting sets are called proper cut-sets. In view of the 
above observations, proper cut-sets are necessarily cut-sets. However, 
not every cut-set is proper. The one shown in Fig. 1-9b, for instance, is 
not. In general, if removal of a cut-set / divides the graph into three or 
more components, the cut-set cannot be proper. For if we return any one 
edge of F’, it can at most reunite two components, and the resulting 
graph still has at least two components. 

If G = (V,£) is a connected graph having at least two vertices and if 
ve V, then the set of all edges (excepting loops) incident with v is a cut- 
set corresponding to the partition {v},{V —v}. It is important to note 
the complementary nature of the concept of a spanning tree and that of a 


Figure 1-9 
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cut-set. The former represents a mini- 


% w mal set of edges which connects all 
Ve ve vertices of the graph, while the latter is 
w a minimal set of edges which discon- 
nects some vertices from others. The 
t following result combines these notions 
and is an immediate consequence of 
Figure 1-10 our definitions: 


Theorem 1-7 Every spanning 
tree has at least one edge in common with every cut-set of a graph. 

Suppose the vertices of a connected graph G are partitioned into non- 
empty sets W and W’ and that P = e1, e:,... , @ is a closed edge 
progression which starts at and returns to vp. Assume, without loss of 
generality, that vp ¢ W (see Fig. 1-10). In traversing P, we either remain 
in W at all times or else cross between W and W’ an even number of times. 
Formalizing this observation yields the following theorem. 

Theorem 1-8 Every closed edge progression in a connected graph 
has an even number (possibly zero) of elements in common with every 
cut-set. Consequently, every circuit has an even number of edges in 
common with every cut-set. 


_ Exercise 1-18 The disconnecting set shown in Fig. 1-9a contains two cut-sets 
which are different from (but not disjoint from) the one shown in Fig. 1-96. Identify 
these. 


Exercise 1-19 The cut-set shown in Fig. 1-9} is actually the union of disjoint 
proper cut-sets. Determine whether this is coincidental or a general property 
possessed by cut-sets which are not proper cut-sets. 


Exercise 1-20 Given a connected graph and two vertex sets W; and W», where 
W: # W: and W2 # Wi, is it possible for Wi and W: to generate the same cut-set? 
(Here W; denotes the set of all vertices of the graph which are not in W:.) 


Exercise 1-21 Connected graphs which contain cut-sets consisting of single edges 
can be characterized in terms of circuits. Give such a characterization. 


Exercise 1-22 Prove that a cut-set F in a connected graph G = (V,£) is proper 
if and only if G’ = (V, E — F) has precisely two components. 


1-10 SOME SPECIAL CLASSES OF GRAPHS 


Graphs can be classified in a variety of ways depending on the structural 
features which are used as the basis for classification. We have already 
seen graphs classified on the basis of connectivity (or lack of it) and on the 
basis of whether they are planar or nonplanar. We now introduce a 
number of other useful classifications. 

A graph is said to be simple if it has no loops and no parallel edges. 
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Note that we could also characterize Vo 
simple graphs as graphs having no 
cireuits composed of fewer than three 
edges. In many situations it is suf- 
ficient to restrict consideration to 
simple graphs. For example, connec- 
tivity (or lack of it) is not affected if 
loops and parallel edges are removed. 
Similarly, if the edges have nonnega- 
tive lengths associated with them, a 
search for shortest chains is not 
affected if loops, as well as all but the 
shortest edge in each set of parallel 
edges, are removed. 

A graph is said to be complete Figure 1-11 
if every two distinct vertices are 
adjacent, i.e., joined by an edge. Usually this term is applied to simple 
graphs. For such graphs there is essentially only one complete graph 
having a specified number of vertices, since all such graphs are iso- 
morphic. Consequently, we speak of the complete graph having k 
vertices. Figure 1-3, for example, represents the complete graph having 
five vertices, 

A graph is said to be bipartite if its vertices can be partitioned into 
two disjoint sets V,; and V2 in such a way that every edge has one end 
point in V; and the other in V2. More generally, a graph is k-partite if its 
vertices can be partitioned into k disjoint sets {Vi,V2, .. . , Vx} in such 
a way that no edge joins two vertices of the same set. It is often con- 
venient to emphasize the structural character of bipartite graphs by dis- 
playing the vertices of V; and V: in separate columns (or rows) as indi- 
cated in Fig. 1-11. 

A graph is said to be k-connected if every pair of distinct vertices » 
and w are joined by at least k chains which have no common vertices 
except, of course, v and w. For example, any simple circuit (except a 
loop) constitutes a 2-connected graph. Whenk = 1, this concept reduces 
to ordinary connectivity. 

If 6(v) = k for every vertex of a graph, the graph is said to be regular 
of degree k or simply k-regular. For example, Fig. 1-12 illustrates several 
connected 3-regular graphs. Note that the (simple) complete graph 
having k vertices is (k — 1)-regular. Also, according to Theorem 1-3, a 
k-regular graph has an even number of vertices, if k is odd. 


Exercise 1-23 What is the largest integer p such that the (simple) complete 
graph having q vertices is p-connected? 
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Exercise 1-24 Are any complete graphs bipartite? In general, what is the small- 
est integer p such that the complete graph having q vertices is p-partite? 


Figure 1-12 


Exercise 1-25 Figure 1-12 illustrates two connected graphs having four vertices 
which are not isomorphic although both are 3-regular. However, one of these is not a 
simple graph. Construct two simple graphs which are connected, have the same 
number of vertices, and are k-regular for the same k, but which are not isomorphic. 
Note that this cannot be done for k = 2. (Why not?) 


Exercise 1-26 Determine whether the following characterization of bipartite 
graphs is equivalent to the defining one: A graph is bipartite if and only if the totality 
of its edges forms a cut-set. 


Exercise 1-27 Formulate several specific application contexts in which the 
appropriate graph to consider is bipartite owing to the nature of the problem. (In 
general, such graphs arise when the vertices represent two distinct types of objects and 
when the edges serve to describe a relationship which by its nature holds only between 
unlike objects such as men and jobs or theatergoers and tickets.) 


Remark: In the preceding pages, the reader may have noticed the 
number of structural concepts which were related to some type of opti- 
mality. To recall a few of them: a simple chain is a minimal set of edges 
connecting two given vertices. A spanning tree is a maximal subgraph 
of a connected graph which contains no circuits and is a minimal subgraph 
which joins all vertices. A proper cut-set is a minimal set of edges whose 
removal disconnects a connected graph. A circuit (other than a loop) is 
a minimal 2-connected graph. Here minimal (maximal) means that no 
proper subset (superset) or subgraph (supergraph) has the property also. 

This theme is strongly present in the remaining pages also, both in the 
definitions of additional concepts and in the nature of many of the exercises 
which are considered. We shall be concerned, for example, with shortest 
and longest routes, maximal network flows, flows having minimal cost, 
partitioning the vertices of a graph into the fewest ‘‘independent” sets, 
ete. It may be of interest to note the various manifestations of optimality 
in the subsequent chapters. 


Exercise 1-28 List 10 activities with which you are familiar to which the con- 
cept of a graph is applicable. Point out some aspect of each which can be studied by 
means of the graph. 
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TWO 


BASIC CONCEPTS: 
DIRECTED GRAPHS 


2-1 INTRODUCTION 


In this chapter we introduce the basic concepts and terms asso- 
ciated with directed graphs, which are graphs having the addi- 
tional characteristic that every edge has been oriented or given a 
direction. The discussion is somewhat abbreviated, compared 
to that of Chap. 1, since many of the concepts are directly 
analogous to those presented for undirected graphs. On the 
other hand, some new concepts that by their nature have no 
undirected counterparts are introduced. 


2-2 DIRECTED GRAPHS 


In many contexts it is necessary to associate with each edge of a 
graph an orientation or direction. In the case of a geometric 
graph this can be interpreted as a direction of traversal of the 
edge, whereas in the case of an abstract graph it means merely 
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that the end points of each edge are distinguished from each other by 
ordering them. Thus the only structural difference between an undi- 
rected graph and a directed graph (also called digraph) is that the end 
points of an edge constitute an ordered rather than an unordered pair of 
vertices. 

In applications of graph theory the need for introducing directions 
on the edges arises in two ways. In some situations, the edges represent 
a relationship between pairs of vertices which is not symmetric. In the 
context of the structure of a system of city streets, for example, it becomes 
necessary to represent one-way streets. Or in the context of communica- 
tions being transmitted between men or machines, devices that are 
essentially unidirectional may be employed. In other situations it is 
necessary to introduce directions in order to establish a frame of reference 
and thus avoid ambiguities. For example, in a connection between 
electrical devices, it is necessary to designate one direction as “positive’’ 
in order to unambiguously describe current flow, even though the actual 
direction of flow may not be essentially restricted. 

Defined formally, a directed graph consists of a nonempty set V, a 
set A disjoint from V, and a mapping A of A intoV X V. The elements 
of V and A are respectively called vertices and arcs (or directed edges), 
and A is called the directed incidence mapping associated with the directed 
graph. If aeA and A(a) = (v,w), then are a is said to have » as its 
initial vertex and w as its terminal vertex. The symbolism a ~ (v,w) will 
be employed to convey the same information in contexts where A is not 
explicitly represented. (As in the undirected case, it is seldom necessary 
to symbolize the incidence mapping itself. Its existence is, however, 
fundamental to the concept of a directed graph.) For our purposes the 
number of vertices and ares is again assumed to be finite. 

Directed graphs will usually be denoted by D or (V,A,4), or by (V,A) 
when A is not used explicitly. Given a directed graph D = (V,A,4), the 
associated undirected graph is the graph G = (V,A,®) whose incidence 
mapping is defined as follows: &(a) = (v & w) whenever A(a) = (vyw). 
Thus G is obtained by disregarding the ordering of the end points of each 
arc. The structural terms introduced in Chap. 1 are also applied to a 
directed graph, with the understanding that the associated undirected 
graph is being described. For example, two ares of D are said to be parallel 
(adjacent) if the corresponding edges of the associated undirected graph 
are parallel (adjacent). 

Two directed graphs are said to be isomorphic if their associated 
undirected graphs are isomorphic in the undirected sense and if, also, 
the end points of each pair of corresponding ares are ordered the same way. 
Expressed formally, directed graphs D = (V,A,A) and D! = (V’,A’,A’) 
are said to be isomorphic if the elements of V and A can be placed in 1-1 
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(a) (b) (¢) 


Figure 2-1 
correspondence with those of V’ and A’, respectively, in such a way that 


A’(a’) = (v',w') 
if and only if 
A(a) = (v,w) 


where a’, v’, and w’ denote the images of a, v, and w, respectively. Thus 
the two directed graphs shown in Fig. 2-la and b are isomorphic. On the 
other hand, the directed graph shown in (¢) is not isomorphic with them, 
despite the fact that the associated undirected graphs of all three are 
isomorphic in the undirected sense. 

If D and D’ are isomorphic directed graphs and if D’ is a directed 
geometric graph, then D’ is called a geometric realization of D. ‘Theorems 
1-1 and 1-2, which characterize the realizability of undirected graphs, are 
equally applicable to directed graphs, since the possibility of making 
the necessary constructions in &* does not depend on whether or not the 
curves are to be considered as oriented. Similarly, a directed graph is 
planar (realizable in 8?) if and only if its associated undirected graph is 
planar. 


2-3 TERMINOLOGY DESCRIBING LOCAL STRUCTURE 


The following terms are useful for describing certain structural features of 
a directed graph which have no undirected counterpart. If a, ~ (v,w) 
and a) ~ (v,w), ares a, and a, are said to be strictly parallel. [Ifas ~ (w,v), 
then a, and a, are parallel but not strictly parallel.] If a ~ (v,w), are a 
is said to be directed away from vertex v and toward vertex w. Are a is 
also said to be positively incident with its initial vertex v and negatively 
incident with its terminal vertex w. The number of arcs which are 
positively incident with v is called the positive degree of v and denoted by 
éd+(v). The negative degree of v is defined analogously and denoted by 
6-(v). (A directed loop incident with v is considered to be both positively 
and negatively incident with v.) The signed degrees 6+(v) and 5-(v) are 
related to the unsigned degree 6(v) introduced earlier by the following 
equation: 


6(v) = S+(v) + 5-(v) 
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(For infinite graphs this relationship remains true if interpreted as an 
equation relating cardinal numbers.) Since every are is positively 
incident with one vertex and negatively incident with one vertex, it is 


evident that 
¥ ste) = Y o-@) = [AI 
veV veV 
where |A| denotes the number of ares of the graph, which is the directed 
counterpart of the relationship 
by 8(v) = 2|A| 
ve 
A directed graph is said to be simple if it has no strictly parallel ares 
and no loops. Note that if a simple directed graph has two parallel but 
oppositely oriented arcs, the associated undirected graph is not simple in 
the undirected sense, since it has parallel edges. The arcs of a simple 
directed graph can be represented, without ambiguity, by ordered pairs of 
vertices, since at most one arc joins a given pair of vertices in a specified 
direction. (Note that this is a specialization of, and not a deviation from, 
the concept of having a set V of vertices and a second set A of ares. No 
restriction was placed on the nature of the elements of A, and so in particu- 
lar they may be elements of V X V.) 


2-4 ARC PROGRESSIONS, PATHS, AND CYCLES 


An are progression of length n is a sequence of (not necessarily distinct) 
arcs 1, @, ... , @ such that for an appropriate sequence of n +1 
vertices vo, 01, . . . , Un We have a; > (vj-1,%;) fori = 1,2,...,n. For 
example, in Fig. 2-2 the sequence a, as, a5, a3 forms an are progression of 
length 4 with vs, v2, v2, v2, v1 a8 the corresponding sequence of vertices. 
An arc progression is said to be closed if vp = vp, and open if v9 # v,. In 
the latter case it is called an are progression joining vp and v, and, more 
specifically, leading from vo to v». 

It is clear that an open (closed) are 
progression in a directed graph deter- 
mines a corresponding open (closed) 
edge progression in the associated undi- 
rected graph. The converse is not gen- 
erally true, however. For example, in 
Fig. 2-2, the sequence aj, a2, ds deter- 
mines an open edge progression joining 
v2 and vs, but it fails to constitute an 
are progression for lack of consistent 
Figure 2-2 directions. 
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An arc progression in which no arc is repeated is called a path progres- 
sion or cycle progression, depending on whether it is open or closed. The 
corresponding set of arcs, without regard to sequencing, is called a path or 
cycle, respectively. If the vertices vo, v1, ... , Un are all distinct (in 
which case the ares are also distinct), the path progression or cycle pro- 
gression, as well as the corresponding path or cycle, is said to be simple. 
A directed graph is said to be cyclic if it contains at least one cycle, and 
acyclic otherwise. (Note that a loop is a special instance of a cycle.) 

Exercise 2-1 Show that the arcs of a path can be arranged in only one way to 
form a path progression if the path is simple. Show also that the converse is not true 
in general. 

Exercise 2-2 Prove that if there exist paths from v to vz and from v2 to vs, where 
v; # vs, then there exists a path from 2 to v3. 

Exercise 2-3 Prove that a nonsimple cycle can be partitioned into two or more 
simple cycles. 


Exercise 2-4 Prove that a nonsimple path from » to w can be partitioned into a 
simple path from v to w and one or more simple cycles. 


Exercise 2-5 Is it true in general that if 5+(v) > land 6-(v) > 1 for every vertex 
» of a finite directed graph, then every vertex is incident with at least one cycle? 


Exercise 2-6 Prove that if D contains a path from w to v and one from v to w, 
then D contains a cycle. However, give an example to show that there may be no 
eycle incident with both » and w. 


Exercise 2-7 Prove that if 6*(v) > 0 for every vertex of a directed graph D, 
then D is necessarily acyclic graph. [The same is true if 6-(v) > 0 for every vertex v.] 


Exercise 2-8 Show that there is no path in Fig. 2-3 which passes through each 
junction vertex once and only once, no matter how the edges are oriented. 


Figure 2-3 


; 28 Basic theory 


Exercise 2-9 Use an argument similar to that of Exercise 2-8 to prove that it is 
not possible for a rook which starts at the southwest square of a chessboard to end at 
the northeast square and pass through every square once and only once. Note that in 
each move a rook can travel any number of squares along the row or the column of the 
square on which it is located. 


2-5 STRONG CONNECTIVITY 


A directed graph is said to be strongly connected if, for every pair of distinct 
vertices v and w, there exists a path from v to w as well as one from w to 
v. It is evident that strong connectivity of a directed graph implies con- 
nectivity of the associated undirected graph. The converse, of course, is 
not generally true. In Fig. 2-4, D, is strongly connected but D» is not. 
In Chap. 3 we shall develop necessary and sufficient conditions for an 
undirected graph to have the property that an appropriate orientation of 
its edges will produce a strongly connected graph. 

A directed graph is said to be strongly k-connected if, for every pair 
of distinct vertices v and w, there exist at least k paths from v to w which 
have no vertices (and hence also no arcs) in common, except, of course, 
forvand w. Tora directed graph to be strongly k-connected, it is clearly 
necessary, but not sufficient, that the associated undirected graph be 
k-connected in the undirected sense. 

Consider a directed graph whose ares represent the (directed) com- 
munication channels between a group of individuals. If the graph is 
strongly connected, then it is possible for every individual to communicate 
with every other individual in at least one way (i.e., by means of at least 
one path), If the graph is strongly k-connected, then there are at least k 
disjoint communication paths from any one individual to any other indi- 
vidual. Thus one would have to disrupt communications at at least 
k points in order to deny communication between any specific pair of 
individuals completely. 


Exercise 2-10 Prove that a finite directed graph is strongly connected if and 
only if there exists a closed are progression which includes every arc at least once. 


D, Dz 


Figure 2-4 
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Exercise 2-11 Prove that a directed graph D = (V,A) is strongly connected if 
and only if, for every vertex partitioning {W, V — W}, the corresponding cut-set 
in the associated undirected graph includes at least one arc directed from W to V — W 
and at least one are directed from V — W to W. 


Exercise 2-12 Prove that if Ci, C2, . . . , Cy isa sequence of cycles in a directed 
graph D, where every two consecutive cycles have at least one common vertex, then 
the subgraph determined by the union of these cycles is strongly connected. 


Exercise 2-13 Let D be a directed graph having a vertex v such that every other 
vertex of D is joined to v by an are in each direction and such that there is at least one 
are which is not incident with y. Prove that for any integer k > 4 and for any two 
(not necessarily distinct) vertices u and w there exists an arc progression from u to w 
having precisely & terms. 


2-6 TREES AND CUT - SETS 


When the terms “‘tree,” “forest,” “disconnecting set,” ‘‘cut-set’’ 
and “proper cut-set”’ are used without qualification, it is to be understood 
that are directions are ignored and the associated undirected graph is 
being described. However, if we do take cognizance of are directions, 
several additional notions emerge. 

A directed graph is said to be a directed tree rooted at vo if (1) it forms 
a tree in the undirected sense and (2) the unique chain determined by vo 
and any other vertex w is in fact a path from vo to w. It is clear that a 
tree can be rooted at at most one vertex and that, in general, trees in 
directed graphs may not be directed trees. In Fig, 2-5a, for instance, 
there can be no directed tree which spans the graph. (Why not?) In 
Fig. 2-5b, on the other hand, the bold ares illustrate a directed tree rooted 
at vo which spans the directed graph shown. In Chap. 3 we shall develop 
a systematic procedure for finding a directed tree, rooted at a specified 
vertex v», whose (unique) paths to all other vertices represent shortest 
paths relative to a specified distance function defined on the arcs. 

Recall that cut-sets, and proper cut-sets in particular, are sets of 
edges which join W and W’, where { W,W’} is a partitioning of the vertices 


(a) (b) 
Figure 2-5 
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(a) (6) (c) 
Figure 2-6 


of a connected graph into two disjoint, nonempty sets. The deletion of a 
cut-set breaks the graph into two or more components. The deletion of 
a proper cut-set results in precisely two components. 

In a directed graph the ares of a cut-set can be classified into two 
groups: those directed from W to W’ and those directed from W’ to W. 
Removal of the former set disconnects all paths from W to W’, while 
removal of the latter disconnects all paths from W’ to W. We shall have 
occasion later (particularly in connection with network flows) to consider 
these two sets individually. The former will be called a W— W’ (read 
“W to W’”) cut-set, and the later a W’ > W cut-set. Figure 2-6b and ¢ 
exhibits these subsets of the cut-set shown in (a). 

Note that if D is strongly connected, then for every partition {W,W’} 
both directed cut-sets are necessarily nonempty. (See Exercise 2-11 con- 
cerning the converse of this statement.) 

Exercise 2-14 If the edges of a tree having n edges are randomly assigned direc- 


tions, what is the probability that the result will be a directed tree rooted at some 
vertex? (The answer depends on n, but not on the specific structure of the tree.) 


Exercise 2-15 Prove that if the edges of a complete simple graph are randomly 
assigned directions, the resulting directed graph necessarily contains a rooted directed 
tree which spans the graph. 


Exercise 2-16 Prove that if a simple graph is not complete, there is at least one 
way to direct its edges so that it does not contain a spanning tree rooted at any vertex. 


2-7 DIRECTED GRAPHS AND BINARY RELATIONS 


If p is a binary relation in a set S, the graph of p, denoted by D,, is the 
directed graph whose vertices are the elements of S and is such that there 
is an arc a= (s,t) if and only if spt. Conversely, if D = (V,A) is a 
directed graph having no strictly parallel ares, the binary relation asso- 
ciated with D, denoted by pp, is the binary relation in V such that v pp w 
if and only if a ~ (v,w) for some are a. 
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The directed graph shown in Fig. 2-7, for example, can be considered 
as representing a certain binary relation defined in a set of eight elements. 
In fact, any binary relation FR defined in a set S can be represented by an 
appropriate directed graph whose vertices correspond to the elements of S. 
The graph exhaustively catalogs the relation by enumerating all pairs of 
related elements. Thusinasense a directed graph is the ‘extensive form” 
of a relation, i.e., it is a complete tabulation of those pairs for which the 
relation holds. Given a particular directed graph, there may or may not 
be a characterization of the same binary relation given by a simple rule 
that avoids complete tabulation. The graph of Fig. 2-7, for instance, is 
actually an alternative representation of the inclusion relation C defined 
on the subsets of a set having three elements. This is apparent from Fig. 
2-8, where the vertices are identified with appropriate subsets of the set 
{x,y,z}. 

We wish to point out that it is generally not profitable to take a binary 
relation, originally given in some “closed form,” and generate the corre- 
sponding graph in order to study the properties of the relation. Nor is it 
often possible, when given a directed graph, to find a simple closed form 
for the binary relation which it represents. But at times this may be 
feasible, and the tools of one theory can be applied to the problems of the 
other. In any case, the conceptual relationship between the two notions 
should be kept in mind. Note that in one sense the notion of a directed 
graph is more general, in that varying numbers of strictly parallel arcs can 
be used to reflect a quantitative degree of relatedness. 

We shall borrow some of the terminology of binary relations to 
describe special classes of directed graphs having no strictly parallel edges. 
A reflexive graph is one having a loop at each vertex. A symmetric graph 
is one for which every arc a ~ (v,w) is matched by an are a’ = (w,v). A 


Figure 2-7 
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transitive graph is one for which the existence of arcs a ~ (v,w) and 
b = (w,z) implies the existence of an are c > (v,x). It follows from this 
that the existence of any path from v to another vertex y implies the exis- 
tence of an are from »v to y. 

The graph shown in Fig. 2-8 is transitive, but it fails to be either 
reflexive or symmetric. In fact, it is asymmetric, meaning that the 
existence of an arc a > (v,w) precludes the existence of an arc a’ ~ (w,v). 


Exercise 2-17 Referring to the definition of a transitive graph in the text, 
vertices v, w, and x were not required to be distinct. Use this fact to show that if D is 
transitive but not reflexive, then there exists a vertex which is not incident with any 
cycle. 


Exercise 2-18 A graph can be symmetric and transitive, yet fail to be reflexive. 
(a) Produce an example to show this. (b) What property characterizes all such 
graphs? 


Exercise 2-19 Let vertices correspond to the integers 0 to 12 and construct 
graphs characterizing the binary relation R, where yRw means: (a) v = w*; (b) v and w 
are congruent mod 4 (that is, » — w isa multiple of 4); (c) v and w are relatively prime. 


Exercise 2-20 A “family tree” T is a directed graph displaying the relation “is a 
parent of” defined for a selected group of individuals. Develop a systematic procedure 
for deriving from T the graph of the relation “‘is a great-grandparent of’’ defined for 
the same group of individuals. 


Remark: In the remainder of this book, except where otherwise 
noted, all graphs mentioned are assumed to be finite graphs. 


THREE 


PARTITIONS AND 
DISTANCES IN GRAPHS 


3-1 INTRODUCTION 


The topics developed in this chapter are related to two general 
themes. The first concerns partitioning the edges, ares, or verti- 
ces of a graph into sets of some specified structural type. For 
example, the famous problem of the seven bridges of Kénigsberg 
involves partitioning the edges of a particular graph into the 
smallest number (hopefully, 1) of cireuits or chains. The second 
theme is the measurement of distances in graphs. The deter- 
mination of a longest or “critical” path through a PERT network 
exemplifies this type of problem. Here one determines the 
shortest time within which some total project can be completed 
by finding a most time-consuming path of component activities. 


3-2 EDGE PARTITIONS 


The first general problem to be considered is that of partitioning 
the edges of a graph into the fewest disjoint subsets each of which 
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Figure 3-1 


is either a chain or a circuit (which need not besimple). Partitions of this 
type will be called coverings, and coverings involving the fewest possible 
chains and circuits will be called minimal coverings. Note that each 
individual edge is either a chain or (in the case of a loop) a circuit. Con- 
sequently, every graph has coverings. Moreover, since we are consider- 
ing finite graphs, clearly minimal coverings always exist. For example, 
Fig. 3-1 shows two different coverings of the same graph. Each consists 
of two chains and one circuit. In either of these, we could incorporate 
the circuit into one of the chains, thus forming a nonsimple ‘chats and 
reducing the number of sets in the covering by 1. 

Isolated vertices are clearly of no consequence when neers 
coverings. Also, a minimal covering for a disconnected graph can be 
obtained by finding a minimal covering for each component having edges. 
Consequently, we shall here concern ourselves only with connected graphs. 
Of particular interest are graphs which can be covered by a single chain or 
circuit, that is, graphs the totality of whose edges form a chain or a 
circuit. Such graphs are said to be wnicursal, inasmuch as it is 
possible to traverse all edges in a continuous manner without repeating 
any edge. 

Returning to the general problem, the characterization of minimal 
coverings depends essentially on whether or not any vertices have odd 
degree. (Recall that when such vertices exist, they occur in pairs.) 
Graphs—whether connected or not—all of whose vertices are of even 
degree are called Euler graphs. For connected Euler graphs, minimal 
coverings are characterized as follows: 

Theorem 3-1 If G = (V,E) is a connected Euler graph, then £ is 
a circuit and hence constitutes the unique minimal covering of G. 

Proof: If G has any loops, we remove them initially and consider the 
reduced graph, all of whose vertices are still of even degree. Starting at, 
any vertex v1, we proceed along any edge to its other end point, say, ve. 
Since v2 is of even degree, it is incident with a second edge, which leads us 
to a vertex v3. We can always proceed in this manner so long as v, # v1y 
since the evenness of all degrees assures us that we can always “leave” 
any other vertex after having “entered” it, using an edge not previously 
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traversed. Since there are a finite number of edges, ultimately we have 
Un = v1 for some n. The edges which have been traversed constitute a 
circuit. If every edge has been traversed, we are through. If not, we 
remove the traversed circuit and consider the resulting subgraph. Every 
vertex is still of even degree, (why?), and hence we can repeat the preced- 
ing construction, finding and removing a second circuit. After a finite 
number of steps, removal of the next circuit will leave no remaining edges. 
This set of circuits, together with the loops removed at the outset, con- 
stitutes a covering of G. But since G is connected, the union of these 
circuits, that is, #, is again a circuit. (Verify this in detail.) This 
completes the proof. 

A circuit which covers a graph is called an Huler circuit. Hence, 
according to Theorem 3-1, every connected Euler graph possesses an 
Euler circuit. Conversely, it is easily seen that a graph which possesses 
an Euler circuit is a connected Euler graph (provided that it has no isolated 
vertices). In contrast to the preceding result, minimal coverings of 
connected graphs having vertices of odd degree consist solely of chains. 
(Of course, these need not be simple chains.) More precisely, we have 
the following: 

Theorem 3-2 Given a connected graph G = (V,Z) having 2n 
vertices of odd degree, where n > 1, every minimal covering of G consists 
of n chains, each of which joins two vertices of odd degree. ; 

Proof: We initially augment G by adding n edges, each of which joins 
two vertices of odd degree, in such a way that every such vertex is incident 
with one of these edges. Let F denote this set of edges. Then 


@ =(V,EUP 


is a connected Euler graph, and hence HU Fisacircuit. When the edges 
of F (no two of which are adjacent) are removed, the circuit is broken into 
n chains, which cover G and which have as end points the 2n vertices of 
odd degree. Hence coverings of the type described in the statement of 
the theorem exist. 

Now consider an arbitrary covering of G. It must include at least n 
chains in order to generate 2n vertices of odd degree. (Why?) If it 
includes more than n chains, or any circuits, it cannot be minimal, since a 
covering with nm members has been shown to exist. If the covering con- 
sists of precisely n chains, they must be of the stated variety. (Why?) 
Hence only such sets of n chains constitute minimal coverings. This 
completes the proof. 

A characterization of unicursal graphs follows immediately from 
Theorems 3-1 and 3-2: 

Theorem 3-3 A connected graph is unicursal if and only if it has 
either 0 or exactly 2 vertices of odd degree. In the former case it is 
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covered by a circuit; in the latter 
case, by a chain joining the two 
vertices of odd degree. 

A classic problem concerning 
unicursal graphs is the Kénigsberg 
bridgeproblem. Itarose with regard 
to a promenade to be taken across 

Figure 3-2 seven bridges which interconnected 
two islands with the two banks of the 
Pregel river and with each other, in Kénigsberg. Letting 1; and v2 denote 
the two banks and 2 and v, the islands, the graph of Fig. 3-2 indicates the 
manner in which the bridges joined the land masses. The problem, 
studied in 1736 by L. Euler, was this: Is it possible to describe a con- 
tinuous route which traverses each bridge precisely once; i.e., is the graph 
of Fig. 3-2 unicursal? Since all four vertices have odd degree, it is not. 
According to Theorem 3-2, at least two chains are required to cover this 
graph. Figure 3-3 exhibits two minimal coverings. (In each case, one 
chain is shown dashed and the other solid.) 


Exercise 3-1 Find minimal coverings for the graphs shown in Figs. 1-2, 1-3, 1-5, 
and 1-6. 


Exercise 3-2 If a connected graph is not unicursal (and hence has at least four 
vertices of odd degree), prove that it has at least two distinct minimal coverings. 


Exercise 3-3 If a minimal covering of a connected graph G has k chains and/or 
circuits, with k > 1, then removal of an edge produces a subgraph whose minimal 
coverings may have k — 1, k, or k +1 elements. Give examples illustrating all 
three possibilities. 


Exercise 3-4 Prove that every cut-set of a connected Euler graph has at least 
two edges and that it necessarily has an even number of edges. 


3-3 ARC PARTITIONS 


We next consider an analogous general problem for directed graphs. In 
this case a covering means a partitioning of the ares into disjoint paths and 
cycles. The problem is formally the same: to characterize the minimal 
coverings of an arbitrary connected directed graph. 


Figure 3-3 
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It is convenient here to partition the vertices of an arbitrary directed 
graph D = (V,A) into disjoint sets 2, 8, and T as follows: 


R = {ve V|s+(x) = 8-(0)} 
S = {ve Vist(v) > 8-(v)} 
T = {ve V\d+(v) < 8-(v)} 


The symbols R, S, and T retain these meanings throughout this section. 

A directed graph is said to be pseudosymmetric at v if 6+(v) = 6-(v). 
A directed graph which is pseudosymmetric at every vertex is called a 
directed Euler graph. (Note that the associated undirected graph is then 
necessarily an Euler graph in the undirected sense.) Thus a directed 
Euler graph is one such that R = V. It was seen earlier (Chap. 2) that 
for any directed graph (V,A) we have 


» d+(v) = Dy 5-(v) 


It follows that S is nonempty if and only if 7 is nonempty and moreover 
that 

¥ (6r@) — HO) = Flew) — HC) 
if S and T are nonempty. For directed graphs, the characterization of 
minimal coverings depends essentially on whether or not S and T are 
empty. If they are empty, we have the following: 

Theorem 3-4 Given a directed Euler graph D = (V,A) which is 
connected, A is a cycle and hence constitutes the unique minimal covering 
of D. 

The proof, which is entirely analogous to that of Theorem 3-1, is 
omitted. For other directed graphs, we have the following: 

Theorem 3-5 If a directed graph D = (V,A) is connected but not 
pseudosymmetric, then every minimal covering of D consists of k paths 
each of which joins a vertex in S to one in 7, where 


k= ¥ (oto) — FO) = J lo-) — 50] 
ve ve 
Proof: As a first step, we augment D by adding k ares in such a way 
that the enlarged graph is a directed Euler graph. Specifically, for each 
vertex we 7 we add 6-(w) — 6+(w) arcs having w as initial vertex and 
some vertex of S as terminal vertex. We can do this in such a way that, 
for every vertex v eS, precisely 6+(v) — 6-(v) arcs have v as terminal 
vertex. Let B denote this set of k arcs, and let D’ = (V, A U B) 
Since D’ is a directed Euler graph, AU B is a cycle. If an are 
b, « B is removed, the remaining arcs form a path from some v e S to some 
weT. Ifa second are b:¢ B is now removed, this path is broken into 
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two paths, each joining a vertex in S to one in T. This would fail to be 
the case only if b; and be were adjacent arcs, but that is impossible. 
(Why?) Proceeding in this manner, after all k arcs are removed, we are 
left with k paths having the stated property, which form a covering of D. 

It remains for us to prove that a covering of this type is minimal and 
that no other coverings are minimal. To do this, we note that an arbi- 
trary covering includes at least 5+(v) — 6~(v) paths having v ¢ S as initial 
vertex. (There will be more than this number if and only if some paths 
of the covering have v as terminal vertex.) Hence, at least k paths origi- 
nate in S. For similar reasons, at least k paths terminate in 7, If the 
covering under consideration has more than k paths or any cycles, it can- 
not be minimal, since we have previously produced a covering having k 
members. If the covering has precisely k paths and no cycles, then the 
k paths originating in S and those terminating in 7’ are the same. In this 
case the covering has the stated characteristics, and the proof is complete. 

The following interesting class of problems, considered by Berge 
(Ref. 5 of Chap. 1), involves an application of Theorem 3-4. This applica- 
tion illustrates the fact that the selection of an appropriate graph to analyze 
is sometimes the crucial step in applying graph theory and that this graph 
is not always evident from the nature of the problem, 

Let the finite set X = {x1,a2, . . . ,v»} be regarded as an “alphabet,” 
whose elements are “letters.” A sequence 7, Y2, . - - , Ya, Where each 
yi is a letter, will be regarded as a ‘‘g-letter word” in the alphabet X. A 
eyclic sequence of letters z1, 22, . . . , 2, is a sequence of letters together 
with the convention that 2: is regarded as the successor of z,(so that, for 
example, z,-1, 2;, 21, 22 is considered to be a sequence of four consecutive 
letters of the cyclic sequence). 

The problem to be considered is this: Given an alphabet X having p 
letters and given a positive integer q, find a shortest cyclic sequence S of 
letters such that every q-letter word appears at least once as q consecutive 
letters of S. Note that there are precisely p? distinct q-letter words which 
are formable in X, so S necessarily contains at most q - p% terms (since we 
could simply itemize all words, one after another) and at least p* terms. 
We shall see that for p > 2 and q > 2 the latter bound is precise; i.e., we 
shall produce a cyclic sequence containing every g-letter word and having 
only p? terms. 

We construct the directed graph D whose vertices are the p?-* 
different (¢ — 1)-letter words formable from X and whose ares are the 
p* different q-letter words formable from X. The initial and terminal 
vertices of the arc yi, yz, . . . , yx are taken to be yi, yo, . . . , Ye-1 and 
Y2; Ys, - » » > Yk, respectively. It is readily verified that D is a connected 
Euler graph and consequently has at least one Euler cycle, which 
necessarily has precisely p* terms. 
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Figure 3-4 illustrates the construction when p = 3 (the specific 
alphabet being X = {a,b,c}) and g= 38. The are from ba to ac, for 
example, is the are bac. (For simplicity, we have omitted commas 
between letters.) The arcs are numbered 1 to 27 to indicate an Euler 


cycle. 
In general, haying found an Huler cycle, we form the cyclic sequence 
(having p* terms) 21, 22, . . . , 2, Where 2; is the first letter of the ith are 


in the Euler cycle. In the example, this sequence (without commas) is 
aabcababbbcbeccacaaacbacchb 


The cyclic sequence S constructed in this manner has the required char- 
acteristics. To see this, note that the g—1 arcs following any arc 


Yr, Y2, + + « » Yq in the Huler cycle necessarily have the form 
Yoys * * * Yo-WYorr 
Yaa * * Yohite 


Figure 3-4 
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S are 
Ya * Ye 


which is the are (that is, q-letter word) with which we arbitrarily started. 
Thus S clearly contains every q-letter word and is as short as possible. 
S also is a solution to the problem of finding a longest eyclic sequence in 
which no g-letter word occurs more than once. (Why?) 

Note that a shortest cyclic sequence containing all q-letter words has 
p® terms also in the cases when q = 1 and/or p = 1. (If q = 1, any 
sequence itemizing the p letters will do; while if p = 1, the single letter of 
the alphabet constitutes the appropriate sequence.) In particular, the 
above construction is trivially applicable when p = 1 and q > 1. 


Exercise 3-5 Construct the directed graph and find an Euler cycle and corre- 
sponding cyclic sequence for the case when X = {a,b} andg = 4. 


Exercise 3-6 Repeat Exercise 3-5 for the case when X = {a,b,c,d} and q = 2. 
Exercise 3-7 Whatis the nature of the graph when X = {a} and q > 1? 


Exercise 3-8 Prove that if every g-letter word occurs precisely once in a cyclic, 
sequence, then every (g — 1)-letter word occurs precisely p times and, in general, 
every (q — k)-letter word occurs precisely p* times, fork = 1,2)..0.5 5g mae 


By use of Theorems 3-4 and 3-5 and by interpretation of unicursal 
to mean capable of being covered by a single path or cycle, the follow- 
ing theorem characterizes unicursal directed graphs: 

Theorem 3-6 Given a directed graph D = (V,A) which is con- 
nected, D is unicursal if and only if it is either a directed Euler graph or is 
such that |S| = |7| = 1 and 6+() — 6-(v) = 1 for the single vertex ve 8. 

Theorems 3-5 and 3-6 play a basic role in connection with the network 
flows to be discussed later. In this connection, the following specialized 
theorem will also be useful: 

Theorem 3-7 Let D = (V,A) be a directed graph which is such 
that |S| = |T| =1. Let s and ¢ denote the single vertices in S and T, 
respectively, and let 


k = 6+(s) — 6-(s) = 6-(t) — oH) 


Then D can be covered by k simple paths from s to ¢, together (possibly) 
with some cycles. 

Proof: If D is connected, Theorem 3-5 asserts that D can be covered 
by k paths of the stated variety. We have seen earlier that any nonsimple 
path can be partitioned into a simple path joining the same vertices, 
together with one or more simple cycles. If D is not connected, let Du, 
D2, . . . , Dn denote its components. Vertices s and ¢ are necessarily in 
the same component. (Why?) This component can be covered by k 
simple paths from s to ¢ together possibly with some cycles, as we have 
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just seen. Each of the remaining components is a directed Euler graph 
which is connected. As such, its ares form a cycle. If this cycle is non- 
simple, it can be partitioned into simple cycles. This completes the proof. 


Exercise 3-9 Find minimal coverings for the directed graphs in Figs. 2-2, 2-4, 
and 2-7. 


Exercise 3-10 Given an Euler graph, show that it is always possible to direct its 
edges in such a way that a directed Euler graph results. 


Exercise 3-11 If a directed Euler graph is connected, is it necessarily strongly 
connected? 


Exercise 3-12 Give a detailed proof of Theorem 3-4. 


A somewhat related problem is the following: Under what conditions 
can the edges of an undirected graph be oriented in such a way that the 
resulting directed graph is strongly connected? For example, when can 
all the streets of a city be restricted to one-way traffic and still permit 
travel from any point to any other point in the city? Such graphs are 
characterized in the following theorem :"+ 

Theorem 3-8 A connected graph G = (V,£) is orientable (in the 
above sense) if and only if every edge of G is contained in at least one 
circuit. 

Proof: Note first that if the theorem is true for simple graphs, it is 
true for all graphs We therefore assume that the graph is simple. It is 
clear that G cannot be orientable if some edge e ~ (v & w) is not contained 
in any circuit; for such an edge represents the only simple chain in the 
graph joining v and w, and as soon as ¢ is oriented, one of these vertices 
will not be accessible from the other. 

To prove the converse, we shall describe a construction which pro- 
duces appropriate orientations. To start, take any simple circuit 
{e1,e2, . . . én} incident with vertices {vi,v2, . . . wn}. If we orient the 
circuit in either direction, we produce a strongly connected subgraph 
consisting of these n ares and vertices. 

Assume that at least one other vertex w besides 1, v2, . . . , Un exists. 
(Otherwise, we are through.) Then at least one such w is adjacent to one 
of the v,’s. Let e be such that e~ (v;& w). By assumption, e is con- 
tained in some simple circuit C. Orient those edges of C which have not 
already been oriented; to do so, use the orientation induced by traversing 
C, in either direction. The resulting, enlarged directed graph is also 
strongly connected (verify this in detail) and contains at least one new 
vertex, namely, w. If not all vertices are included, a new w is selected 
and the process is repeated until all vertices are included. 


} Superscript reference numbers pertain to the list of references at the end of the 
chapter. 
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3-4 HAMILTONIAN CHAINS AND 
CIRCUITS 


We have already considered the prob- 
lem of determining whether or not a 
given finite graph is unicursal. It is 
natural also to formulate the follow- 
ing superficially related problem: 
Under what circumstances does a 
finite connected graph contain a chain 
or circuit which is incident with every 
vertex? If, in addition, the chain or 
circuit is required to be simple, it is 
correspondingly called a hamiltonian 
Figure 3-5 chain (often referred to as a hamil- 
tonian line) or a hamiltonian circuit. 

If a graph possesses a hamiltonian circuit S, then it clearly possesses a 
hamiltonian chain. In contrast, the bipartite graph of Fig. 3-5 possesses 
several hamiltonian chains but no hamiltonian circuit. And, of course, 
many graphs possess neither. For some special classes of finite graphs, the 
existence or nonexistence of hamiltonian chains or circuits is easily settled. 
For example, a tree cannot possess a hamiltonian chain unless the tree 
itself constitutes a chain. (The details are left to the reader.) Simi- 
larly, Fig. 3-5 exemplifies the general fact that no bipartite graph having 
an odd number of vertices possesses a hamiltonian circuit. (Every 
simple circuit in a bipartite graph has an even number of edges and hence 
is incident with an even number of vertices.) On the other hand, every 
complete graph clearly has numerous hamiltonian circuits. 

While partial results for special classes of graphs exist, the problem 
has not been adequately settled in general. For instance, there is no 
efficient constructive procedure for finding a hamiltonian chain in an 
arbitrary graph or else providing information which proves that none 
exists. 

The celebrated mathematician Hamilton introduced a commercial 
game, whose object was to find a hamiltonian circuit in the graph deter- 
mined by the vertices and edges of a certain polyhedron. An account of 
this may be found in Ref. 20 of Chap, 1. 

Remark: The problem of finding a hamiltonian circuit can be viewed 
as a special case of the following problem: In a graph G, each of whose 
edges e has a positive length L(e), find a simple circuit whose total length 
is maximal. By setting L(e) = 1 for every edge, the length of a simple 
circuit is simply the number of edges which it contains and, equivalently, 
the number of vertices with which it is incident. If G has n vertices, then 


Partitions and distances in graphs 43 


G has a hamiltonian circuit if and only if a longest simple circuit has 
length n. 

Exercise 3-13 The planar graph shown in Fig. 3-6 represents the vertices and 
edges of a dodecahedron. (A dodecahedron is a solid having 20 vertices and 12 
pentagonal faces. ‘The outside region also represents a face.) Find a hamiltonian 
circuit for this graph, his is the graph involved in Hamilton’s game mentioned 
earlier. 


Figure 3-6 Figure 3-7 


Exercise 3-14 The graph shown in Fig. 3-7 is obtained from that of Fig. 3-6 by 
rotating the outer pentagon 36° and modifying the end points of five edges (such as 
edge e) accordingly. Note that this graph also has 20 vertices and 12 pentagonal 
faces (including the outside region). Prove, however, that there exist no hamiltonian 
circuits in this graph. (Hint: Exploit the five vertices of degree 2). 


Exercise 3-15 The graph G associated with a rhombic dodecahedron has eight 
vertices of degree 3 and six of degree 4. Moreover, it is bipartite: no two vertices of 
the same degree are adjacent. Prove that @ has no hamiltonian chain (and hence also 
no hamiltonian circuit).!* 


Exercise 3-16 Use Exercise 3-15 as a guide to formulate a more general necessary 
condition for a connected bipartite graph to have a hamiltonian circuit or chain, 

We turn now to a particularly interesting topic related to the concept 
of hamiltonian circuits. An undirected graph G is hypohamiltonian 
(HH for brevity) if it has no hamiltonian circuit but every subgraph of 
immediately smaller order has such a circuit. The problem is to find 
an HH graph of minimum order and show that the solution is unique. 
We assume that the graph has n vertices and m edges, and is a simple 
graph. Throughout this discussion, d(v) is synonymous with 6(v). 

Lemma 3-9 If G@ is HH, then n > 3. 

Lemma 3-10 If@ is HH, d(v) > 3 for every v. 

Proof: Every vertex is in a simple circuit of length n — 1, and hence 
d(v) > 2. If w is adjacent to v, then in the subgraph obtained by sup- 
pressing w we also have d(v) > 2. Thus d(v) > 3. 
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v Lemma 3-11 If G is HH and if w 

x and x are consecutive vertices of a simple 
cireuit of length n — 1 in the graph in 
which a vertex v is suppressed, then v is 
not adjacent to both w and a, 

Proof: Otherwise the insertion of v 
yields a hamiltonian circuit for the 
entire graph. 

Lemma 3-12 If G is HH, then 
d(v) < (n — 1)/2 for every v. 

Proof: Follows from Lemma 3-11. 

Lemma 3-13 If G is regular of 
constant degree d(v) = d, then d-n = 2m. 

Figure 3-8 Lemma 3-14 If v, w, «, w’, «’ are 
distinet vertices of G such that v is ad- 
jacent to w and w’, and x is adjacent to «’, and if (w,xz) and (w’,z’) 
are two edges of the same hamiltonian circuit of the graph obtained by 
suppressing v, then G has a hamiltonian circuit. 
Proof: The circuit is indicated in Fig. 3-8. 
Theorem 3-15 IfG is HH, then n > 7. 
Proof: Follows from Lemmas 3-10 and 3-12. 
Theorem 3-16 IfG is HH, then n #7. 
Proof: By Lemmas 3-10 and 3-12, n = 7 implies that d(v) = 3 for 
every v, contradicting Lemma 3-13. 
Theorem 3-17 IfG is HH, then n # 8. 
Proof: By Lemmas 3-10 and 3-12, n = 8 implies d(v) = 3. If G is 
HH of order 8, by Lemma 3-11 it is one of two graphs, Figs. 3-9 and 3-10, 
each of which has a hamiltonian circuit by Lemma 3-14. 


2! w’ 


Figure 3-9 Figure 3-10 
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5 
Figure 3-11 Figure 3-12 


Theorem 3-18 IfG is HH, thenn ¥ 9. 

Proof: By Lemma 3-13, if G is HH of order 9, it cannot be regular of 
degree 3; and by Lemma 3-12, it has at least one vertex of degree 4. By 
Lemma 3-11, it has the partial graph of Fig. 3-11. Vertex 2 must be 
adjacent to at least one of the vertices 4 to 8, since d(2) > 3. But the 
edges 2-4, 2-6, and 2-8 each yield a hamiltonian circuit (by Lemma 3-14). 
If 2 is adjacent to 5, then 8, which by the same reason cannot be adjacent 
except to 3 or 5, is not adjacent to 5 without having d(5) > 4 contrary to 
Lemma 3-12. Thus 8 is adjacent to 3. Similarly, 6 is adjacent to 1 and 
4 to 7, and there can be no other edges since the vertices 9, 1, 3, 5, 7 are 
saturated. The same figure (see Fig. 3-12) would be obtained if 2 were 
adjacent to 7. Note that every edge joins two vertices whose numbers 
have opposite parity. Thus the graph obtained from @ by suppressing 
vertex 1 has no hamiltonian circuit, and therefore G cannot be HH. 

Theorem 3-19 IfG is HH of order 10, it is homogeneous of degree 3. 

Proof: Let some vertex have degree 4. G admits the partial graph of 
Fig. 3-13. By Lemma 3-14 each of 2, 4, 5, 7, 9 cannot be adjacent except 
to one or two of 1, 3, 6, 8. Since each of the latter cannot be adjacent 
except to at most one of the former vertices, we have a contradiction. 

Theorem 3-20 The graph of Fig. 3-14 is HH. 

Proof: Suppressing vertex 1, we have the circuit (0,4,3,2,6,5,9,8,7,10) ; 
and suppressing vertex 2, we have the circuit (0,1,9,8,3,4,5,6,7,10). The 
other cases are obtained by symmetry. The graph has no hamiltonian 
circuit. Thus the maximal elementary chains which start with 10, 1, 2 
are (6,5,4,3,8,7), (6,5,4,3,8,9), (6,5,9,8,3,4), (6,5,9,8,7), (6,7,8,3,4,5,9), 
(6,7,8,9,5,4,3), (3,4,5,6,7,8,9), (3,4,5,9,8,7,6), (3,8,7,6,5,4), (3,8,7,6,5,9), 
(3,8,9,5,4), (3,8,9,5,6,7) and none of them gives a hamiltonian circuit. 
By symmetry, this is the case for all other chains. 
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Figure 3-13 Figure 3-14 


Theorem 3-21 IfG is HH of order 10, it is isomorphic to the graph 
of Theorem 3-20. 

Proof: By numbering the vertices 1, 2, . . . , 9 of the circuit in the 
graph without vertex 10 there are three types of graphs possible for G: 


1. The type in which 10 is adjacent to 1, 4,7, which by Lemma 3-14 
and Theorem 3-19 gives the graph of Fig. 3-13 or the graph of Fig. 3-15. 
But the latter has the hamiltonian circuit (10,1,2,9,8,7,6,5,3,4,10). 

2. That in which 10 is adjacent to 1, 3, 6, which leads to the graph in 
Fig. 3-16 in which we have the hamiltonian circuit (10,6,7,5,4,3,2,8,9,1,10). 

3. That in which 10 is adjacent to 1, 3, 5, which leads to the graphs 
shown in Figs. 3-17 and 3-18, which respectively have the hamiltonian 
circuits (10,1,2,3,4,7,8,9,6,5,10) and (10,1,2,7,8,9,6,5,4,3,10). 


Figure 3-15 Figure 3-16 
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Figure 3-17 Figure 3-18 


The concepts of hamiltonian chains and circuits can be extended in a 
straightforward manner to directed graphs. Specifically, a simple path 
or cycle which is incident with every vertex is called a hamiltonian path or 
hamiltonian cycle, respectively. (A hamiltonian path is sometimes referred 
to as a directed hamiltonian line.) Hamiltonian paths and cycles in a 
directed graph determine corresponding hamiltonian chains and circuits, 
respectively, in the associated undirected graph. They require, in addi- 
tion, consistency of are orientations; hence we can expect their existence 
to bestill more rare. We shall indicate several results which are applicable 
to a special class of directed graphs. 

A directed graph will be said to be d-complete if every pair of distinct: 
vertices are joined by precisely one arc. Thus one can obtain a d-complete 
graph by taking a complete undirected graph (which has no parallel 
edges) and arbitrarily directing its edges. In contrast to the fact that 
every complete graph has numerous hamiltonian circuits, a d-complete 
graph may have no hamiltonian cycles. For example, if there exists a 
vertex v for which either 5+(v) = 0 or 5-(v) = 0, then there can exist no 
hamiltonian cycles. More generally, it is clear that the existence of a 
hamiltonian cycle implies that the graph is strongly connected. (Why?) 

If we do not insist on returning to the starting vertex (i.e., if we will 
settle for a hamiltonian path), the prospects are a good deal brighter. 
Kénig has proved that every d-complete graph @ contains at least one 
hamiltonian path. For a proof, see pages 30 and 31 of Kénig’s book 
(Ref.16 of Chap.1). Rédei!*has shown that there are in fact anodd num- 
ber of hamiltonian paths. Camion? investigated uniqueness and obtained 
the following theorem: 

Theorem 3-22. A d-complete graph G contains a unique hamil- 
tonian path if and only if @ contains no cycles. 
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Proof: If G contains two different hamiltonian paths, they determine 
two permutations of the vertices of G, and these differ by at least one 
inversion. Hence there are two vertices each of which is joined to the 
other by a path. This implies the existence of a cycle. On the other 
hand, if a cycle exists, then the method of construction employed by 
K@énig and described in his book, cited earlier, has options which lead to 
different hamiltonian paths. 

A directed graph is said to be total if every pair of distinct vertices 
is joined in at least one direction by a path. Note that a d-complete 
graph is a special type of total graph in which the above paths are single 
ares. All strongly connected graphs are also total. The following result, 
also due to Camion and stated here without proof, applies to all total 
graphs. 

Theorem 3-23 A necessary and sufficient condition that a directed 
graph without cycles have precisely one hamiltonian path is that it be total. 

An interesting related problem seeking the shortest hamiltonian 
circuit is that of a traveling saleman who must pass through each of n 
cities (every two of which are connected) precisely once and return to his 
starting point. Thus the salesman must travel the shortest distance. 
It is clear that computing the length of all hamiltonian circuits to deter- 
mine the shortest one involves (n — 1)!/2 possible circuits, which is an 
astronomical number for large n. The problem is to find a more conven- 
ient algorithm for the solution of the problem. No such general algorithm 
exists as yet. For a number of computational algorithms dealing with 
this problem see Refs. 3, 7, and 9. 


Exercise 3-17 Let dj; be the distance between the ith and jth cities. Give a 
matrix interpretation of the problem. One is tempted to select the smallest distance 
connecting any pair of cities, cross out the corresponding row and column, and repeat 
the process. But this does not in general yield an optimum solution. Now take five 
cities, set up the matrix assigning distances, compute an optimal hamiltonian circuit 
by exhausting the possibilities, and compare with the plausible procedure mentioned 
above. Show that in some cases the solution does not contain the shortest edge and 
may contain a longest one. 


In this regard it is interesting to note that the shortest closed route 
connecting n points in a plane not all on a straight line is a simple polygon. 
In particular, if the convex hull of the set contains none of the n points 
in its interior, then its boundary is the shortest closed route. (Thus there 
need not be any crossing of routes in the corresponding traveling sales- 
man problem.) To prove this fact, we connect the n points by any 
closed path and note that a shorter path is obtained by connecting the 
points in the same order by straight line segments with the given points 
as the only vertices. If the segments v,v;41 and vj,;,1 intersect in a point 
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v, we suppose that the path followed is»; vi41 + + * Yor * + 0 Ifvis 
not one of the given points, then vp + * * viy1¥j41 * * * Us is a shorter 
polygonal path which does not contain the intersection v. If, on the 
other hand, »v is in the set of given points, then vvvj * + + Vip °° * Vi, 
is a polygonal path which does not contain » as an intersection point. 
(A sketch will clarify the argument.) 

For the interested reader, we present an interesting and simple 
approach to the traveling salesman problem due to P. C. Ryan of the 
Naval Research Laboratory. This approach is not any more efficient 
than other algorithms, but it provides some insight. 

Recall that the problem is to find a shortest hamiltonian circuit 
when each edge of the graph is assigned a length. The graph need not 
be complete, in which case the existence of one or more hamiltonian 
circuits is not assured, and consequently the problem may have no solu- 
tion. If parallel edges are present, we can clearly disregard all except 
the shortest of each parallel group. The present algorithm will either 
find a shortest hamiltonian circuit or indicate that none exists—in either 
case in a finite number of steps. 

We will assume that the graph under consideration is planar, and 
the fact (established in Chap. 4) that a simple planar graph has at least 
one vertex of degree 5 or smaller. In addition, the edge lengths will be 
assured positive. 

Starting with an arbitrary vertex v; of minimum degree d < 5, it is 


possible to determine () < 10 distinct subgraphs, each consisting of all 


vertices of the original graph, all edges that are not incident with v1, and 
two of the d edges that are incident with »,. Clearly, the problem of 
finding all hamiltonian circuits of the original graph is equivalent to find- 
ing all hamiltonian circuits in these subgraphs, each of which has a vertex 
(v:) of degree 2. 

Now consider one of these subgraphs G; = (V,E;,®,), where |V| = n, 
©; = 6/2; (the restriction of ® to Z,), and degree v; = 2. Starting with 
the edge e; incident with »; and the adjacent vertex of least degree (if both 
adjacent vertices are of the same degree, either will suffice), we construct 
a hamiltonian circuit C, as the end result of a sequence of n simple 
chains Cy = {e1, . . . ex}, where e; ~ (0; & vj41) forj = 1,... ,k, with 
the understanding that v.41 = v1. (This notation is meant to imply that 
the edges and vertices are labeled sequentially as they occur in the con- 
struction of the chain.) An edge e, may be adjoined to C;_1 if the follow- 
ing conditions are satisfied: 

1. v4: #2; (G = 1, ... , k), unless k = n, in which case the single 
exception noted above applies. 
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2. Cy = Cy-1U {ex} does not separate the remaining graph; in the 
sense that every pair of vertices vp, vg € {vk42, . . . » Un¢1 = V1} can be 
connected by a chain none of whose edges is incident with some vertex in 
{ve, see » Mepi}. 


Conditions 1 and 2 ensure that the final chain will be a simple circuit. 

It is likely that at some stage more than one edge will qualify for 
addition to the chain at the same vertex. When this occurs, an arbitrary 
incident edge of least length is selected and unselected qualified edges 
are noted. Later each of the unselected edges must be investigated. 

Tf no edge satisfying conditions 1 and 2 can be found at the terminal 
vertex 041 of a chain C,(& <n), the chain is not part of a hamiltonian 
circuit and is abandoned. At this point, a return is made to the most 
recent arbitrary selection and an alternate chain is investigated. When 
all such selections have been checked, any hamiltonian circuits which 
exist will have been found. Following this procedure for each subgraph 
G; yields all hamiltonian circuits. 

Let s; be the length of the shortest edge in the 7th subgraph [that is, 
8; = ee A(e), where d is the length function defined on EJ. After the 


first ‘asuiitoniias circuit C,, has been found (in any subgraph), its length 
|Cn| may be calculated. Let Cj, denote any subsequent chain under con- 
struction in the ith subgraph, with length |C;|. In order for Cj, to result 
in a hamiltonian circuit of length shorter than C,, we must have 


ICy| + (n — k)s, < [Cal 


at every step of the process (i.e., for every value of k= 1,... , m). 
This inequality provides another criterion for rejecting edges which are 
under consideration for addition to the chain. 

To illustrate, we apply the method to the graph G of Fig. 3-19. The 
integer labels represent the lengths associated with the edges. Figure 
3-20a, b, and ¢ gives the three subgraphs that result from the (semi- 
arbitrary) choice of »; and the corresponding values of s;. 

Figure 3-21 illustrates the series of steps which are 
2 required to construct the first hamiltonian circuit (in @;). 
The edges shown by heavy lines are the circuit edges. 
oes Condition 2 of the method is equivalent to requiring 
a that the graph obtained by deleting from G, the heavy 
1 5 and dotted edges and the circled heavy vertices (at each 
ca step) be connected. Numbers in parentheses at a ver- 
7 tex refer to the number of unselected (but qualified) 

Figure 3-19 edges remaining to be investigated there. 
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G: (8, =) G: 9 (8=1) G: 9 (83=1) 


+ Length = 13 
5 


Figure 3-21 


Returning to consider the alternative at step 2 (above), the situation 
is as depicted in Fig. 3-22. Note that the edge of length 5 is not a permissi- 
ble alternative in step 2 (Fig. 3-22), since we already have a complete 
hamiltonian circuit of length 13. Since no further edges may be added to 
it, this chain is abandoned and G; may be dropped from consideration. 

Figures 3-23 and 3-24 illustrate the steps necessary for full considera- 
tion of G, and G3. Note that the unused alternative in the second sketch 
of Fig. 3-23 becomes an illegitimate alternative (by the chain length con- 
dition) when the hamiltonian circuit of length 10 is found (see Fig. 3-23). 

Although this method works well for small-scale graphs like the one 
of the example and quite possibly would be acceptable (if done graphically 
as above) for somewhat larger-scale planar graphs, it would not be easy to 
mechanize, especially since condition 2 (which is easy to visualize graphi- 
cally) would be difficult to formulate efficiently on a computer. 


Exercise 3-18 In the directed graph of Fig. 3-25 start at A, visit each intersec- 
tion only once, and return to A. There is a route. Find it.4 


Figure 3-22 
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Figure 3-23 


D 
Figure 3-24 Figure 3-25 
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3-5 VERTEX PARTITIONS 


A set of vertices is said to be independent if it contains no two adjacent 
vertices. In particular, a single vertex constitutes an independent set. 
One sometimes encounters the problem of finding a largest independent 
set of vertices in a given graph. 

For example, suppose that, we wish to form the largest possible 
committee from a group of individuals certain pairs of whom cannot be 
jointly assigned. This problem is of the above type, where the vertices of 
the graph represent the various individuals and the edges indicate con- 
flicting pairs. We may as well assume that the graph determined in this 
munner is connected. (If it is not, we can select a maximal subcommittee 
in each component and combine the subcommittees to solve the problem.) 

A reasonable approach would be to start with one individual and 
successively add others, at each stage selecting one who does not conflict 
with any previously selected individual. Such a procedure will ultimately 
yield a maximal independent set W, in the sense that W is not a subset of 
any larger independent set. We cannot be assured, however, that W will 
have the largest possible number of vertices (the maximum maximorum), 
since the size of W depends on the particular vertices selected at each 
stage of the process. 

As an extreme case consider the graph shown in Fig. 3-26. If we 
start with any vertex other than 7; and at each stage select any vertex 
other than »;, a maximal independent set W, that includes all but one 
vertex of the graph is ultimately obtained. On the other hand, if we have 
the misfortune to initially select v1, we are through, and we have discovered 
a maximal (though very small) independent set W» consisting of a single 
vertex. In this illustration, the complement of one maximal independent 
set is another maximal independent set—a coincidence. In Fig. 3-27 the 
independent set W = {vs,vs} is maximal, but its complement is not 
independent. 
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Figure 3-27 


A maximal independent set W has the property that it dominates the 
graph in the sense that every vertex is either a member of W or is adjacent 
to a member of W. For if an independent set does not dominate the 
graph, there exists at least one vertex which is neither in the set nor adja- 
cent to it. We can adjoin this vertex to the set without destroying inde- 
pendence. ‘Thus the set is not maximal. 

A set of vertices having this dominance property is called a dominating 
set, whether it is independent or not. In view of the preceding remarks, 
the number of vertices in a largest independent set is at least as great as 
the number in a smallest dominating set. (In Fig. 3-26, W2 is a smallest 
dominating set and W; is a largest independent set.) 

In a simple graph G a vertex v dominates itself and the 5(v) other 
vertices with which it is adjacent. Consequently, a set of k vertices 


S = {vi .. . ,%} dominates at most k + by 5(v;) vertices. If G has n 
bi 


vertices, it follows that a necessary condition for S to be a dominating 
set is that 


k 
k+Y5u) > 
1 


In particular, a dominating set for a simple graph which is p-regular 
{6(v;) = p for all 7] must contain at least n/(p + 1) vertices. 


Exercise 3-19 Produce several graphs for which a largest independent set coin- 
cides with a smallest dominating set. Can you think of any classes of graphs which 
have this property? 


Exercise 3-20 In the text the procedure for producing a maximal independent 
set of vertices was seen to fail (sometimes) to produce a largest independent set. Con- 
sider the following modification: (1) start with a vertex of smallest degree and (2) at 
every stage select as the next vertex to enter the set one which is adjacent with the 
fewest possible vertices not in the set (and, of course, with none already chosen). 
This procedure will correctly find a largest independent set for the graph shown in Fig. 
3-26. However, construct a graph for which it fails to yield a largest independent set. 
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Exercise 3-21 We have seen that a maximal independent set is necessarily a 
dominating set. A minimal dominating set may be independent (see, for example, 
Fig. 3-26). Give an example to show that it need not be. 


A related class of problems, and one which has been the object of 
much research, is the following: Partition the vertices of a given graph 
into the smallest number of independent sets. Problems of this type, 
frequently called coloring problems, have the alternative formulation: 
Assign to each vertex a color (or abstract label) in such a way that adja- 
cent vertices always have different colors (labels) and the number of 
different colors (labels) employed is as small as possible. 

This smallest number is called the chromatic number of the graph, in 
keeping with the above coloring formulation. The chromatic number of 
a simple graph G@ having n vertices is less than n if @ is not complete and is 
greater than 1 if G has any edges at all. The chromatic number of a given 
class of graphs is defined to be the largest chromatic number realized by 
any member of the class. Expressed differently, it is the smallest number 
of colors which suffices to color every graph in the class. This number 
can be determined for various classes. As an example, the reader may 
verify the following theorem: 

Theorem 3-24 Every tree (except the degenerate one consisting 
of a single isolated vertex) has chromatic number 2. 

It is important to recognize the close relationship between the 
chromatic number of a graph and the concept of a k-partite graph. The 
chromatic number of G is simply the smallest & such that G is k-partite. 
In particular, Theorem 3-24 asserts that every tree is bipartite. Two 
representations of the same tree are shown in Fig. 3-28. The second shows 
the bipartite property clearly, while connectivity and the absence of 
circuits are more apparent in the first representation. 

We shall return later to the topic of coloring problems and, in particu- 
lar, to the problem of determining the chromatic number of the class of 
planar graphs. This is the celebrated four-color problem (since it is con- 
jectured that the chromatic number is four). 
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Figure 3-29 


Remark: Tait conjectured that the graph of every convex polyhedron 
contains a hamiltonian circuit, and the four-color conjecture would follow 
immediately from that. It is known that a graph is a skeleton of a convex 
polyhedron if and only if it is planar and it is 3-connected. Thus Tait’s 
conjecture is equivalent to the fact that every 3-connected planar graph 
has a hamiltonian circuit. Tutte’ has proved that the planar graph of 
Fig. 3-29 has no hamiltonian circuit. 


3-6 RADIUS AND DIAMETER 


Let @ = (V,E£) be a connected graph. We can reasonably define a 
distance between vertices as follows: If v ~ w, d(v,w) denotes the minimum 
number of edges contained in any chain joining v and w, whereas d(v,v) = 0 
for all v. 
For a fixed vertex v, the integer 
R(v) = max d(v,w) 

weV 
measures the distance from v to the vertex (or vertices) most remote from 
v. It is intuitively clear that a vertex is relatively central if R(v) is rela- 
tively small. Thus it is natural to call 

Ro = min R(v) 


ve 
the radius of the graph and to refer to any vertex vp as a center if 
R(v%) = Ro 


Figure 3-30a and b shows graphs having radii 1 and 3, respectively. 
A graph may have many centers. In Fig. 3-30b, for example, every vertex 
is a center. 
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An application of another measure of centrality is found in Sec. 6-16, 
in connection with communication networks, 

The diameter T of a connected graph is the maximum distance between 
pairs of vertices. Symbolically, 

T = max d(v,w) 
v,weV 

The graphs shown in Fig. 3-30 have diameters 2 and 3, respectively. 
Note from the second example that the radius and diameter of a con- 
nected graph may be equal. 


Exercise 3-22 Let @ be the graph determined by a simple chain having 2n edges. 
Determine its radius, diameter, and number of centers. 


Exercise 3-23 Repeat Hxercise 3-22 for a simple chain having an odd number of 
edges and for simple circuits having even and odd numbers of edges. 


Exercise 3-24 Characterize simple complete graphs in terms of the concept of 
diameter. 


Exercise 3-25 (a) Show that the radius and diameter of a connected graph 
necessarily satisfy the relationship 7/2 <R<T. (b) Produce examples which 
realize each extreme. 


Exercise 3-26 Show that in a tree having at least two edges, the radius R and 
diameter T always satisfy R < T. 


Exercise 3-27 Show that d(v,w) possesses the usual properties of a metric, or 
distance function: 
dvv) = 0 
d(vjw) > 0 ifw #v 
d(v,w) = d(w,v) 
dup) + d(vw) > d(u,w) 


The concepts of radius, center, and diameter can be extended to 
directed graphs of certain types. If we redefine d(v,w) to mean the num- 
ber of ares in a shortest path from v to w, then we must either assume that, 
there exists a path from » to w, for every v and w, or else consider the radii 
and diameters of certain directed graphs to be undefined. We can avoid 


(a) (b) 
Figure 3-30 
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the latter possibility by restricting considera- 
tion to strongly connected graphs. 

If D = (V,A) isastrongly connected graph, 
we formally define the radius, diameter, and cen- 
ters precisely as in the undirected case, using the 

z w modified definition of d(v,w). In order to avoid 
Figurag=3? confusion, the radius and diameter in the di- 


rected case will be denoted by R and T. 

In Fig. 3-31 we have a directed graph D obtained from the undirected 
graph of Fig. 3-30a by a particularly unfortunate orientation of edges. 
It is easily seen that this graph is strongly connected and that 

R=2 
the vertices v and w being centers. Moreover, 


T=6 


this distance being realized by the (unique) simple path from z to v. So 
in the directed case, the radius may be less than half of the diameter, 
which ultimately stems from the fact that the distance function is no 
longer symmetric. [Note, for example, that in Fig. 3-31 we have 
d(av) = 6, while d(v,x) = 1.] 

These unusual characteristics can be avoided by restricting considera- 
tion still further to the class of directed graphs which are connected and 
symmetric. But for such graphs the directed and undirected meanings of 
radius and diameter are essentially the same. Every chain joining v and 
w in the associated undirected graph gives rise to a path from v to w (as 
well as one from w to v) having an equal number of ares. Thus the two 
meanings of d(v,w) produce numerically identical distance functions. 


3-7 MINIMAL - LENGTH PROBLEMS 


Consider a graph each of whose edges (or ares, if the graph is directed) has 
an associated real number, which we shall call its “length.” Depending 
on the application, this number may be a measure of physical distance, of 
time consumed, of cost, or of some other factor of interest. Although we 
shall use terminology appropriate for physical distances, the other possible 
interpretations should be borne in mind. The two essential characteristics 
of length, for the present discussion, are these: 


1. It must be additive in the sense that the length of a collection of 
edges or arcs may be taken to be the sum of the lengths of the individual 
edges or arcs. 
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2. It must be a measure which it is inherently desirable to either 
minimize or maximize, subject to restrictions concerning which sets of 
edges or ares are to be considered “admissible.” 


The traveling salesman problem is an illustration in which the admis- 
sible sets of edges are those that form hamiltonian circuits. In Sec. 6-31, 
spanning trees will be the admissible sets. 

The problem to be considered now is that of finding a shortest path 
between two specified vertices in a directed graph. More precisely, let 
v9 and v; be distinct vertices in a directed graph D = (V,A) and let X(a) 
denote the length of are a. Assuming that there exists at least one path 
from v» to v1, the problem is to find a path P = {ay,a2, . . . ,a,} from vp 
to v; whose length is minimal. 

Note first of all that a shortest path from vp to v; necessarily exists 
even if some or all ares have negative length, since there are only a finite 
number of paths from vp to 2; (and, by assumption, there is at least one). 
However, in a very real sense there may be no shortest distance from vp to 
v; under certain cireumstances. This is the case if there is a cycle C such 
that \(C’) < 0 and if there is a path P from vy to 1; via a vertex w incident 
with C. For then we could construct are progressions whose lengths are 
(algebraically) as small as we please by traversing P from v9 to w, then going 
around C a sufficient number of times, and finally traversing P from w 
tov. For this reason, we shall assume that 


NC) > 0 


for every cycle C. Under this assumption, while a nonsimple path P may 
be a shortest path if \(C) = 0 for each cycle in P, it is sufficient to consider 
simple paths in order to find a shortest path. 

Tt turns out that in the course of finding a shortest path from vo to v1, 
we shall as a by-product produce a shortest path from vp to every vertex 
w # vo which is “reachable” from vp via a path. So we may as well 
suppose that we wish to solve the more general problem of finding a path 
from some fixed ‘“‘reference vertex” vy to every vertex w which is reachable 
from vo. Toward this end, we present the following definitions: 

A shorlest-distance tree T relative to reference vertex vp is a directed 
tree, rooted at vo, such that the (unique) tree path from vp to each tree 
vertex w # vo is in fact a shortest path from v to w. A mazimal shortest- 
distance tree relative to vo is a shortest-distance tree which includes every 
vertex reachable from vp. We shall show that, for every directed graph D 
having a distance function \ which satisfies 


x(C) 20 


for every cycle C, and for any fixed reference vertex v, D does in fact con- 
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tain as a subgraph at least one maximal shortest-distance tree relative to 
vo. We first establish the following theorem: 

Theorem 3-25 Let T bea rooted tree ina directed graph D = (V,A) 
rooted at vp and containing all vertices reachable from vo. If v is any 
vertex of 7’, let L(v) denote the tree distance from vo to v, with L(vo) = 0. 
Then T is a shortest-distance tree, relative to vo, if and only if every 
chord (v,w) whose end points are both in 7’ satisfies 


L(w) < L(v) + A(o,w) 


Proof: If L(w) > L(v) + A(v,w) for some chord, then the tree path 
from vo to v, together with the chord (v,w), has length L(v) + d(v,w). 
Since this is smaller than L(w), the length of the tree path to w is clearly 
not minimal. 

Conversely, suppose that 7’ is not a shortest-distance tree. Let v be 
a vertex of T for which the tree path is not a shortest path and let P = ai, 
a2, . . . , d, be a shortest path progression from vp tov. Let 

Giz1 = (viVi4s) 

be the first (i.e., lowest indexed) are of P which is not an are of T. Since 
P is a shortest path progression from v to v, P; = ay, a2, . . . , a; must 
be a shortest path progression as far as v; forj = 1,2,...,m. (The 
verification of this statement is left to the reader.) But \(P:) = L(w), 
since P; and the tree path agree as far as vertex v; By assumption, 
L(vix1) > M Pigs) = L(vi) + (aiz1). Thus we have produced an are, 
namely, 41, such that its end points satisfy 


L(w) > L(v) + d(v,w) 


This completes the proof. 

Theorem 3-25 provides a theoretical basis for an iterative procedure 
which will actually yield a maximal shortest-distance tree, relative to a 
given reference vertex vo, provided that 


x(C) = 0 


for every cycle of the graph under consideration. The procedure is as 
follows: 


1. Take as 7 any tree, rooted at vo, that includes all vertices which 
are reachable from vo. 

2. In general, having obtained tree Ti, let Li(v) denote the distance 
from vo to v through 7;. If every chord (v,w) satisfies 


Li(w) < Liv) + X,w) 


then, by Theorem 3-25, 7’; is a maximal shortest-distance tree and we are 
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through. + If not; let (v*,w*) be a chord such that 
L(w*) > Li(v*) + A(v*,w*) 


Let Ti+: be the graph obtained from 7; by adding are (v*,w*) and deleting 
the are of 7; whose terminal vertex is w*. Given that 7; is a tree rooted 
at vo and that 

x(C) >= 0 


for every cycle, it follows that 7’: constructed as above is necessarily 
also a tree rooted at vp. (This assertion is left as an illuminating exercise 
for the reader.) Step 2 is now repeated, with 7 + 1 in place of 7. 


For any vertex » which is reachable from vo, the sequence of values 
Lo(v), Li(v), Lo(v), . . . isa noninereasing sequence. Moreover, at each 
stage 7, Li(v) < Li-,(v) for at least one vertex (namely, the vertex w* 
above). Finally, the values.of L;(v) are bounded below, since there do 
not exist paths from vp to » having indefinitely small lengths. The 
observations of this paragraph, taken together, imply that at some stage 
7 we must have 

Li(w) < Liv) + AQv,w) 


for every are (v,w) which is a chord relative to T;. When this occurs, 
T; is the desired maximal shortest-distance tree according to Theorem 3-25. 

Ezample: Figure 3-32 illustrates this method for finding a shortest- 
distance tree. In this example, the reference vertex isv. Part (a) shows 
the arc lengths. In part (b) the solid arcs represent an initial tree To 
rooted at vp which spans the graph. The values Lo(v) are also shown. 
The dotted arc in (b) is an “improvement” chord (since 3 > 0+ 1). 
When this are is added and the appropriate are is removed, the new 
spanning tree 7’; shown in part (c) is obtained. In a similar manner, 7’, 
and 7’; are obtained; they are shown in parts (d) and (e), respectively. 
By checking all chords relative to 7’;, we can verify that 7’; is indeed a 
shortest-distance tree. Note that there may be several “improvement” 
chords at any stage. In that case any one may be selected. Different 
choices may lead to different final trees (and possibly in a different number 
of steps) since the shortest-distance tree is not in general unique. 

By defining \(a) = 1 for all a A, we can find paths which are short- 
est in the sense that they contain the fewest possible arcs. A variety of 
techniques for finding shortest-distance trees in directed graphs have been 
developed. See Ref. 10 for a comparative discussion of several algorithms; 
the one described here is based on the “labeling” or “index-reduction” 
approach described in Refs. 5 and 6. 


Exercise 3-28 Repeat the preceding example, but select a different improve- 
ment chord to go from 7; to a different tree 7’. 
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Exercise 3-29 Apply the tree-modification method to the directed graph in Fig. 
3-83, finding shortest paths from 1 to all other possible vertices relative to the distance 
function (a) = 1 for all arcs. 


Exercise 3-30 Repeat Exercise 3-29 with the arc lengths indicated in Fig. 3-33. 


Figure 3-33 


The following example illustrates an alternative procedure for finding 
shortest paths through a directed graph by using the concept of dynamic 
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7 
Figure 3-34 


programming.' Consider the directed graph shown in Fig. 3-34. (All 
ares are assumed to be directed to the right, but for simplicity the arrows 
have been omitted.) The number associated with each arc is to be inter- 
preted as its length, and the problem is to find a shortest path from each 
vertex x (other than w) to w. 

The crucial feature of our problem, from the standpoint of applying 
dynamic programming, is this: If the sequence of vertices 


Dy, Ut ~ 6 4 Ens W 
determines a shortest path from 2; to w, then the sequence 
Ws, Dey 2 2 6 5 ny VO 


necessarily determines a shortest path from «2 tow. This enables us to 
consider the vertices in stages, considering at the ith stage those vertices 
which are i arcs removed from w. (Note that for the graph under con- 
sideration, all paths from a vertex x to w have the same number, say k, 
of vertices. Hence the designation of stages is not ambiguous.) 

With each vertex x which is one are removed from w, we associate the 
(in this case unique) are A(x) having x as initial vertex. We also asso- 
ciate with x a distance D(x) which is the length of this are. In our exam- 
ple, these distances are 2 (for the top vertex) and 1 (for the bottom vertex). 

In general, having found A(x) and D(x) for all vertices which are k 
or fewer ares distant from w, we find A(x) and D(x) for each vertex © 
which is k + 1 ares removed as follows: 


D(x) = min {A(x,y) + D(y)} 


where the minimum is taken over all ares a > (a,y) having x as initial 
vertex. (There are at most two such ares in the example.) In addition, 
A(z) is taken to be the are which minimizes the expression. It is readily 
seen, from the truncation property mentioned earlier and from the fact 
that the stage 1 distances are clearly minimal, that the stage k +1 
distances determined in this manner are minimal for every stage k + 1. 
The bold ares (again shown without arrows) and numbers in Fig. 3-35 show 
the values of A(x) and D(z) for the first three stages, 
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Exercise 3-31 Find A(x) and D(z) for the vertices in stages 4 to 6, Fig. 3-35. 
In particular, find a shortest path from v to w. 


Figure 3-35 
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PLANAR AND NONPLANAR GRAPHS, 
COLORING THEOREMS 


4-1 INTRODUCTION 


This chapter has two basic aims. The first is to define and give 
conditions under which a graph is planar, i.e., can be mapped on 
the plane. The theorem of Kuratowski(which excludes the 
oceurrence of two special graphs as subgraphs of a planar graph> 
yields one well-known characterization of planar graphs. Another 
interesting necessary condition for a graph to be planar is that it 
be isomorphic to one whose edges are straight. The second aim is 
to study chromatic graphs with the objective of giving some 
coloring theorems. These theorems take this form: Given a 
planar map, i.e., a planar graph and the regions enclosed by its 
simple circuits plus the outside region, such that every vertex is 
of degree greater than two, determine whether it can be colored 
with 7 colors in such a way that no two regions with a common 
boundary of one or more edges have the same color. Or, given n 
colors, find the conditions on a map for which n is the minimum 
coloring number. Our emphasis is on the existence rather than 
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on the actual construction of coloring schemes. The concept of dual 
graph gives another characterization of a planar graph and plays a useful 
role in studying the coloring problem. 


4-2. PLANAR GRAPHS 


Recall that a planar graph is one which is isomorphic with a geometric 
graph in a plane, i.e., it can be drawn in the plane in such a way that its 
edges intersect only at their end points. A graph can be mapped from a 
plane on a sphere by placing the sphere at some point of the plane and 
considering the point of contact as the south pole and then using the north 
pole as the center of projection of straight lines which join the north pole 
to every point of the graph in the plane. The intersection of these lines 
with the sphere yields the desired projection, known as a stereographic 
projection, which is in 1-1 correspondence with the original graph. 

This process can be reversed so that a graph on a sphere can be 
mapped into the plane, provided that the north pole is chosen to be a 
point not on the graph. (An interesting difficulty arises if the north pole 
is selected as a vertex; for then that vertex is mapped at infinity, yielding 
a needless complication.) Note that any region of a graph drawn in the 
plane can be made the outside region by means of two mappings. First 
the graph is mapped on the sphere in such a way that the south pole 
makes contact with a point inside the desired region. The map is then 
projected back on the plane by using the former north pole as the point of 
contact with the plane and the former south pole as the center of the pro- 
jection lines. 


Exercise 4-1 Indicate how a pair of projections can be applied to the graph in 
Fig. 4-1 in order to place the triangular region inside the rectangle. 


In coloring problems one is frequently given a problem on a sphere. 
The foregoing stereographic projection shows that it is adequate to study 
coloring problems in the plane if the infinite region is included in the set of 
regions to be colored. 

We now turn to the question of characterizing planar graphs and give 
basic material which leads to the proof of the theorem of Kuratowski. 
A simple graph that has the smallest number of vertices and is yet non- 
planar is the complete graph on five vertices which we call a type 1 

Kuratowski graph (see Lemma 4-4), 
shown in Fig. 4-2. It is obvious that 
if any graph contains this five-vertex 
graph (or indeed any nonplanar 
graph) as a subgraph, the graph is of 
Figure 4-1 necessity nonplanar. An example of 


Planar and nonplanar graphs, coloring theorems 67 


a nonplanar graph which does not contain this complete graph is the 
“three utility-three house” graph shown in Fig. 4-3. We shall call this 
graph a type 2 Kuratowski graph. The name “utility graph” arises from 
the problem of connecting each of n houses to each of n utilities by means 
of conduits which do not pass over one another (i.e., which determine a 
planar graph). This is not possible for n > 3, as Lemma 4-4 shows. 
These two (minimal) possibilities—the complete graph with five vertices 
and the utility graph—cover the most general situation for nonplanarity. 

Definition A vertex v is a point of articulation of a connected 
graph @ if the graph H = G — », obtained by deleting v and all edges 
incident with v, is disconnected. A graph is said to be separable if it 
has at least one articulation point. 

Lemma 4-1 A necessary and sufficient condition for a vertex v of 
a connected graph to be a point of articulation is that » lie on all the chains 
connecting some pair of vertices. 

Proof: If v is a point of articulation, then its removal from the con- 
nected graph yields at least two components, and hence any pair of 
vertices each taken from a different component must have all chains con- 
necting them pass through v. On the other hand, if every chain con- 
necting some pair of vertices passes through v, then the removal of v 
would disconnect the graph, and hence » is a point of articulation. 

Lemma 4-2 (Theorem of Menger-Dirae.'4) If Ya, Y% are two 
vertices of a graph of n > 2 vertices without a point of articulation and if 
U = {va,¥1, . « - Undo} is a simple chain connecting them, there are two 
chains U; and Us connecting v. and v which have no other vertex in 
common and along each of them from v, to % the vertices of U (if any) 
occur in increasing order of subscripts. 

Proof: The theorem is proved by induction. It is true if the chain 
consists of a pair of vertices connected with a single edge. We assume it 
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is true for any pair of vertices if the length of the chain connecting them is 
at most m and show that it holds for a chain of length m+ 1. Define 
0 = Vay Um+1 = Y and suppose that U is the chain joining vo with m1. 
Let the vertices of U have the order vo, 11, . . « , Um; %m41- By the induc- 
tion assumption v and vy» are connected by two chains U, and U2 which 
satisfy the two properties required in the theorem. Note that the por- 
tion of U itself which joins vo with v,, may be one of the two chains Ui, U2. 
Since v,, is not a point of articulation, it does not separate vp and 41 and 
hence there is a chain U* connecting vp and 41 which does not pass 
through v,. This chain has a part W which connects a last one of the 
vertices Uo, 01, - «+ 5 Ym—1 tO Um41- 

Let v* be the first vertex (other than 0,41) which is incident with at 
least one of the chains U1, Us, or U and which is encountered when U* is 
traversed from vm41 toward vp. Let W = U*(v*,vm41), that is, the sub- 
chain of U* joining »* and vn41. If v* is incident with U,, then 


U1 = Uy(vo,v*) UW 
and Us = U2 Ut m;¥mes) 


satisfy the conditions of the theorem. An analogous situation holds if »* 
is incident with Us. Finally, if v* is incident only with U, then o* = 
for some k such that 0 <k <m—1. Let vp be the vertex with largest 
subscript not exceeding k — 1 which is incident with Ui or Uz. If vp is 
incident with U;, then 


Ui Tivo») U U(lp,%) UW 
and U2 = U2U U (Um 0m+1) 


are the desired chains. (If v9 = vp, the first subchain associated with U1 
is to be ignored.) An analogous pair of chains exist if vp is incident with 
Us. This completes the proof. 

Lemma 4-3 A connected planar graph with n vertices, m edges, 
and r regions (including the outside or unbounded region) satisfies Euler’s 
formula n — m +r = 2. 

Proof: Note that n — m + r is unchanged if a vertex of the graph is 
joined by an edge with another vertex (not intersecting other edges); 
for this operation produces a new region. (The addition of an edge must 
produce a circuit, because the graph is connected.) Similarly, this quan- 
tity remains the same if a new vertex is introduced and connected to two 
vertices of the graph by means of two edges; for this also adds a region. 
The left side also remains unchanged if, instead of adding vertices and 
edges as described, one removes them according to this procedure. By 
means of these operations of addition and removal, the figure can finally 
be reduced to a triangle. This is a graph with n = 3, m = 3, andr = 2, 
since the region outside the triangle is also counted, from which the result 
follows. 


i 
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Lemma 4-4 The Kuratowski subgraphs are nonplanar. 

First proof: If the utility graph were planar, we would have, on sub- 
stituting in Euler’s formula, r = 5. Each region must have at least 
four edges for boundary; for if any region had three edges, then two of 
the three vertices would be either houses or utilities and they would be 
adjacent, which is a contradiction, Since each edge is the boundary of 
two regions and is thus counted twice, we must have 4r < 2m or m > 2r, 
thatis,9 > 10. From this absurdity we conclude that the utility graph is 
nonplanar. 

In the ease of the complete graph each region has at least three edges 
for boundary, from which 3r < 2m; and because r = 7 and m = 10, we 
have the contradiction that 21 < 20. 

Second proof: Another method of proving Lemma 4-4 makes use of the 
Jordan curve theorem. This theorem states that a simple closed curve 
(the homeomorph of a circle) divides the plane into two regions whose 
common boundary is the curve. A corollary of this theorem is that a 
simple curve joining two points in these two regions intersects the 
boundary. To prove that the utility graph is nonplanar, connect two of 
the utilities to two of the houses as in Fig. 4-4. This yields a Jordan 
curve. The third utility is either inside or outside the region enclosed by 
the curve. Suppose it is inside. (If it is outside and it is connected to 
the houses, then another utility must be inside.) We connect it to the two 
houses. Then no matter which region the third house belongs to, it will 
be separated by a Jordan curve from a utility and cannot be joined to it. 
Consequently the graph is nonplanar. 

To prove that the complete graph is nonplanar, consider four points 
joined twoatatime. This gives Fig.4-5. As can be seen from the figure, 
for v; to be joined with v3 and v2 with vs without intersection, one edge 
must be inside and the other outside the rectangle. No matter where the 
fifth vertex is placed, it would be separated by a Jordan curve from one of 
the other vertices and we again conclude that the graph is nonplanar. 

The planarity or lack of planarity of a graph is clearly not affected if 
an edge is divided into two edges by the insertion of a new vertex of 
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degree 2 or if two edges incident with a vertex of degree 2 are merged into 
one edge by removal of that vertex. This suggests the following defini- 
tion: Graphs G and G’ are isomorphic to within vertices of degree 2 if they are 
isomorphic or if they can be transformed into isomorphic graphs by 
repeated application of the above transformations, 

Theorem 4-5 (Kuratowski.) A graph is planar if and only if it 
does not contain any subgraph which is isomorphic, to within vertices of 
degree 2, to either of the Kuratowski subgraphs. 

Exercise 4-2 Show that the existence of a subgraph which is isomorphic to 
within vertices of degree 2 to a Kuratowski graph is equivalent to the existence of five 
or six points together with a set of vertex-disjoint chains (except for their end points) 
which—when the chains are regarded as single edges—constitute a Kuratowski graph. 

The proof we shall present of Theorem 4-5 is a substantial refinement 
due to Berge® of Kuratowski’s original proof.*° 

Proof: We have already demonstrated that a planar graph does not 
contain either of the Kuratowski subgraphs or a subgraph which is 
isomorphic to one of these to within vertices of degree 2. The proof in 
the other direction, i.e., that a nonplanar graph contains one or both of the 
Kuratowski subgraphs, proceeds by induction on the number of edges. 
We use the equivalent contrapositive statement that if a graph does not 
contain either of the Kuratowski subgraphs, it must be planar. It is 
obviously true for a graph with one, two, and three edges. Let the 
contrapositive be true for less than m edges. We now show by contradic- 
tion that it is true for m edges. Hence we suppose that @ has m edges, is 
nonplanar, and contains neither of the Kuratowski subgraphs but that 
every subgraph with m — 1 edges is planar. The contradiction will be 
that G contains a Kuratowski subgraph. 


1. G must be connected; for otherwise all its components have fewer 
than m edges and are planar, and hence G would be planar. 

2. The graph G is inarticulate. Note that a point of articulation 
may be mapped into the boundary of the infinite region by placing a sphere 
in a region having this point on its boundary and inverting the graph with 
respect to the sphere. The removal of this point would leave components 
planar, because they have fewer than m edges, and thus its addition would 
yield a planar graph. 

3. Weshow next the existence of an elementary circuit on suppressing 
an edge, i.e., the removal of an edge [a,b], leaves a circuit S passing through 
aandb. To see this, observe that the subgraph G’ = G — [a,b] must be 
either (1) articulate or (2) inarticulate. In case 1 there is a point of 
articulation c through which passes every chain between a and 6 in G’. 
However, this leads to a contradiction, for the removal of ¢ leaves two 
components C, and Cy. Let Ci and C; be obtained from C, and C, by 
adding the edges [a,c] and [b,c], respectively. By the original statement 
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(not the contrapositive), neither C, nor C; contains a Kuratowski sub- 
graph because G does not. By stereographic projection [a,c] and [b,c] can 
be made into edges of the infinite region. In this region, if @ and b are 
joined, the result is still a planar graph, since the edge is completely 
within the one outside region. This graph is planar and contains G. 
Since this is a contradiction to the assumption that G is nonplanar, we 
conclude that G’ cannot be articulate. 

Thus, case 2 holds and @ is inarticulate and, by Lemma 4-2, there are 
two chains joining a and b and forming a circuit, i.e., they have no point 
in common; therefore, @’ has a circuit. This circuit will now be used to 
contradict the nonplanarity assumption on G. 

4. Let S be a simple circuit passing through a and 6 in G’ containing 
in its interior the maximum number of regions. Let us introduce an 
arbitrary orientation on S. The parts of G’ inside S are called the interior 
graph, and those outside are called the exterior graph. 


The exterior graph cannot contain more than one vertex of the half 
circuit S{a,b]; for otherwise one can construct a circuit passing through a 
and b which contains in its interior a larger number of regions. This is 
also true of the half circuit S[b,a]. The exterior graph, however, must 
have one or two points in common with S; for if it had none, the graph @ 
would be disconnected. On the other hand, there exist at least one con- 
nected subgraph I of the interior graph and one connected subgraph E of 
the exterior graph each of which meets both S]a,b[ and S]b,a[, where S]a,b[ 
denotes the graph obtained from S{a,b] by deleting the vertices a and b. 
To see this, note that @’ is planar and introducing [a,b] into it must make it 
a nonplanar graph. But for G to be nonplanar, [a,b] must either intersect 
the subgraph consisting of S]a,b[ and S]b,a[ plus the interior graph or 
intersect the subgraph consisting of S]a,b[ and S]b,a[ plus the exterior 
graph. 

Furthermore, S must contain four vertices which are alternately 
points of contact with J and with 2. To prove this, suppose the contrary 
and let e; and f; be two consecutive points of contact of a connected sub- 
graph J, of the interior graph which meets S]a,b[ and S]b,a[. These two 
points can be joined by a path P lying outside S and not meeting any of 
the existing edges, since by hypothesis there exists no connected subgraph 
E of the exterior graph which meets SJa,b{ and S]b,a[ and which meets 
Sle:,fi{ and S]f,,e:[. Every connected subgraph of the interior graph, 
including J;, which meets only S{fi,e;] can be transferred to the exterior 
inside the region limited by P and S. This leaves at least one connected 
subgraph J» of the interior graph which meets S]a,b[ and S]b,a[ and has 
two points of contact e2 and f2 on S{e:,fi), at least one of which is on S]e:, fil. 
We can now replace J, by J: and continue the transfer procedure. Since 
the graph is finite, the entire interior graph can be moved to the exterior, 
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Figure 4-6 


which is a contradiction; i.e., the graph must then be planar, since nothing 
intersects [a,b]. Thus there exist connected subgraphs J and E of the 
interior and exterior graphs, respectively, which meet S]a,b[ and S]b,a{ and 
which intersect S at alternate points c, d (of Z) and e, f (of I). 

The case when both the exterior and interior components have com- 
mon points of contact at S or coincide with a or b is remedied by intro- 
ducing additional points to satisfy the requirement of alternation. All 
the possibilities can be covered by taking four points for the interior com- 
ponent. We suppose that e, f, g, h are the points of contact of J with S 
such that ee Sle,d[; feSld,cl; geSlabl; he S]b,al. (See Fig. 4-6.) 
Obviously e can never be f nor can g be h, but we can have e = g and 
e = h, ete., and so we consider all these cases: 


1. If ee S]b,a{ and fe Sla,b[, let e = hand f =g. This leads to the 
graph of type 2, which is contrary to hypothesis. 

2. If both e and f are on Sja,b[, let h = ¢; for if h # c, he S]b,a{ and 
we are in case 1. But if h = c, we again get a graph of type 2. 

3. Ife = aand/f = b, for example, f ¢ S]a,b[, we obtain a type 2 graph. 

4. Ife = aand f = b, we can assume g = d and h = ¢; for if not, we 
are in either case 1 or case 3. There are two cases to be considered: 

(a) If the chains of J joining cd and ef have more than one vertex in 
common, we get a type 2 graph. 

(b) If the chains of J joining cd and ef have only one vertex in com- 
mon, we get a type 1 graph. 


As indicated in Fig. 4-6, case 4b leads to a type 1 graph and the remain- 
ing cases to a type 2 graph and thus, in any case, to a nonplanar graph, a 
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contradiction, This completes the proof. (Note in Fig. 4-6 that circles 
and squares enclose those vertices which determine the Kuratowski sub- 
graphs. Vertices b and d in Tig. 4-6(2) and (3), respectively, are to be 
regarded as vertices of degree 2.) 

In addition to the theorem of Kuratowski there are other theorems 
which characterize a planar graph. These are stated without proof. 

Theorem 4-6 (MacLane.) A necessary and sufficient condition 
for a graph to be planar is that it have a circuit basis (see matrix represen- 
tation of graphs) together with one additional circuit such that this 
collection of circuits contains each edge of the graph exactly twice. 

Theorem 4-7 (Whitney.) A necessary and sufficient condition 
for a graph to be planar is that it have a dual (see below). 


The Dual Graph 


Consider a planar graph G with regions R; (¢ = 1,...,n). Letus 
associate a pointp; with each region FR; (that is, take a point in the region). 
If two regions 2; and R; are adjacent, join p; and p,; by edges pip; which 
cross the common boundary edges of R; and R; only once and have no 
point in common with any other boundary of the graph. An edge of G 
which is not the boundary of a region lies in a region and yields a loop 
(see Fig. 4-7). This procedure yields a new graph G with vertices 
Py +++ pn It is called a dual graph of G. 

We have illustrated the idea of the dual graph with dashed lines in 
Fig. 4-7. It is clear from this construction that all graphs dual to G are 
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topologically equivalent; hence one speaks of the dual graph. (Note 
that G ¥ G if @ is not connected, where G is the dual of G.) 


Exercise 4-3 Start with the dual graph given in Fig. 4-7 and construct its dual. 
Show that the dual is equivalent to the original graph. 


Exercise 4-4 Show that there is no map of five regions in the plane such that 
every pair of regions is adjacent. 


Definition A graph is trivalent if it is regular of degree 3. 

Theorem 4-8 All regions of the dual of a planar trivalent con- 
nected map are three-sided and their number is even. 

Proof: It is easy to see that corresponding to each vertex of the graph 
there is a triangle of the dual. Thus we have for the dual 3r = 2m, which, 
when substituted in Euler’s formula, gives r = 2(nm — 2). 

Remark: One can associate a vertex with each edge of a graph. If 
two vertices are joined by an edge whenever the corresponding edges are 
adjacent, the resulting graph is known as a line graph. 

Theorem 4-9 (Krausz.2*) A necessary and sufficient condition 
for a graph to be a line graph is the existence of a partition of its edges into 
complete subgraphs such that no vertex of the graph belongs to more than 
two subgraphs. 


Exercise 4-5 Verify that, for the line graph of the complete graph on n vertices. 

(a) Each vertex has degree 2(n — 2). 

(b) Exactly four vertices are adjacent to both of any two nonadjacent vertices. 

(c) Any two adjacent vertices are mutually adjacent to exactly n — 2 vertices. 

These three necessary conditions are also sufficient (except for n = 8) for a graph 
to be the line graph of a complete graph on n vertices. 


Exercise 4-6 Show that exactly 4n — 12 vertices are adjacent to either one of 
any pair of adjacent vertices of the line graph in Exercise 4-5. 


We shall now show that a necessary condition for a simple graph to 
be planar is that it can be drawn in a plane with straight lines. (Any 
simple graph whether planar or not can be drawn with straight lines in 
three-space.) 

Definition A straight graph is a planar graph in which every 
edge is a straight-line segment. 

Definition If every region of a planar graph is bounded by a cir- 
cuit of three edges, the graph is said to be triangulated. Fary*' has 
proved the following theorem: 

Theorem 4-10 Every simple planar graph is isomorphic to a 
straight graph. 

We shall prove this theorem in the sequence of Lemmas 4-11 to 4-15. 

Lemma 4-11 If a graph G is isomorphic to a straight graph, then 
each of its subgraphs is also isomorphic to a straight graph. 
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Proof: If G is isomorphic to a straight graph G, then since a subgraph 
of G is obtained by deleting the vertices and edges not in the subgraph, the 
corresponding operation in G leaves a straight subgraph isomorphic to 
the desired subgraph in G. (Here G does not represent the dual of G.) 

Lemma 4-12 Eyery simple planar graph G is a subgraph of a tri- 
angulated graph with the same number of vertices. 

Proof: Suppose G consists of at least four vertices. (The case of 
three or fewer vertices is left as an exercise.) We construct a triangulated 
graph G’ which contains G asa subgraph. If R is a region of G and v, and 
vg are two vertices of R without a connecting edge, we connect v; and v2 
with an edge lying entirely in R. This yields a new graph which is again 
simple. This operation of selecting unconnected pairs of vertices on the 
boundary of a region and connecting them by an edge in the region is con- 
tinued until every pair of vertices on the boundary of the same region are 
connected by anedge. Since each time this is done the number of vertices 
remains the same, the process terminates, yielding a simple graph G’. 

Clearly, G’ is connected ; otherwise, there is a region R adjacent to two 
components of G and we choose v; on the boundary of one of these com- 
ponents and v2 on the boundary of the other and join them by an edge in R. 
This contradicts the construction of @’. To show that G’ is triangulated, 
note first that if some region is bounded by only one edge, then the graph 
consists of this edge and its two vertices. A region cannot be bounded by 
two edges; for then the graph would contain only three vertices. Both 
cases contradict our assumption of at least four vertices. Thus the 
boundary of each region R’ consists of at least three edges, e1, €2, e3. If 
they do not form a cireuit, then there are at least four vertices 11, va, v3, v4 
on the boundary of R’. Now G’ contains the edges connecting these 
vertices, that is, v2, Vs, V4, Vs, Vea, Vgv4 (See Fig. 4-8) which divide 
the plane into four regions on the boundary of each of which there 
are only three vertices. But this is a further partitioning of R’, which is 
itself a region in G’, and we have a contradiction. Thus é1, ¢, es form a 
circuit and every region of G’ is bounded by a circuit of three edges. 

The next two lemmas are needed for the proof of Lemma 4-15. 


we 
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Lemma 4-13 Let G be a simple triangulated graph with at least 
four vertices. Ifvvy, vv, . . . , vv, are edges starting from v in their cyclic 
(e.g., clockwise) order, then the edges C, = (vivo,vovs, . . . vets) belong 
to G and form a circuit which separates » from every other vertex of G. 

Proof: Tt suffices to prove the lemma with the assumption that v does 
not lie on the boundary of the infinite region (containing the point at 
infinity); for by means of spherical projection v can be made to lie on the 
boundary of an interior region. Since the graph is triangulated, v must 
lie inside some circuits and hence there is an innermost circuit C) with 
vertices vj, 0}, . . . , 0}, Which contains v in its interior. Every consecu- 
tive pair of vertices v/, vj,, lies on the boundary of a region which is tri- 
angulated and hence contains a third vertex v’. We assert that v’ = v; 
for otherwise the circuit (v{v4,0sv4) . «000g « « » U_d4) is a circuit 
having » in its interior and is inside C’, a contradiction. Hence the tri- 
angular regions adjacent to C’; have the common vertex v; and since G is 
simple, C, must coincide with C, and the proof is complete. 

If v in Lemma 4-13 (assumed to lie on an interior region R) and its 
connecting edges are deleted, leaving an empty interior for C,, and if v; is 
jointed to vs, . . . , %—1 by edges not crossing each other but lying inside 
C,, perhaps resulting in a not necessarily simple graph G, then we have the 
following lemma: 

Lemma 4-14 If the graph G is not simple, then G has a circuit of 
three edges which separates two vertices. 

Proof: G can be nonsimple only if »; is connected to some vertex 
v; (8 <i < k — 1) by a new edge and by an edge outside C,. The circuit 
(v 1v,v0,,01) in G with vv, outside C, separates v2 and »,. This proves the 
lemma. 

Lemma 4-15 Every triangulated simple graph is isomorphic to a 
straight graph. 

Proof: The lemma is obviously true for graphs with three vertices. 
Suppose it is true for graphs with n vertices and let @ be a graph with 
n+ 1 vertices. Let v be a vertex of G on the boundary of an interior 
region R and construct G as above. The infinite region has the same 
boundary in G as in G. 

If G is simple, then by the inductive assumption there is a straight 
graph S isomorphic to G. We must now make sure that 3, the vertex 
corresponding to », can be chosen in such a way that it can be connected 
with straight segments to pertinent vertices and without intersecting 
other segments. Consider the circuit C, = (Oi02, . . . ,d:01) of § corre- 
sponding to C, in Lemma 4-13, where 9; in S§ corresponds to »; in G. The 
straight lines which include 3,0;,1 (2 < i < k — 1) do not pass through 0; 
for then 5,5; and #,0;,,; would have a common segment. Thus @; belongs 
to one of the two half planes defined by the straight line 3,5;,1 (2 <7 < 
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k — 1) and the intersection of these half planes is a convex region with 
interior points. Consider the common part K of this region and the 
interior of C,. The segment joining an arbitrary point of K with a 
boundary vertex of C, lies inside C,, except for its end point. If the seg- 


ments 5,05, 0:4, . . . , 10,1 were deleted from 8, the interior of C, would 
be empty. By selecting a point 3 inside K and drawing the segments 
5D, 02, . . . , 00%, we obtain a graph isomorphic to G. 


If G is not simple, then by Lemma 4-14 there is in @ a circuit A of 
three edges which separates two vertices. The circuit A, with the outside 
edges, forms a triangulated graph G;. It also forms such a graph G_ with 
the inside edges. Both graphs have at most m vertices and hence, by 
the inductive assumption, are isomorphic to straight graphs 8, and Ss, 
respectively. In S, the interior of the circuit A; corresponding to A is 
empty. On the other hand, S, lies inside As, the circuit corresponding to 
A in Ss. By an appropriately chosen affine transformation, A, can be 
mapped into A; so that adjacent regions in G are mapped into adjacent 
regions. This yields a straight graph isomorphic to G and completes the 
proof of the theorem. 


Polyhedral Graphs 


The reader unfamiliar with the terminology of the next two para- 
graphs may go on without loss of continuity. 

A graph is called polyhedral if its vertices and edges may be identified 
with the vertices and edges of a convex polyhedron in some euclidean 
space. The polyhedron is called the realization or embedding of its 
graph. We shall say that a graph is n-polyhedral if it corresponds to an 
n-dimensional polyhedron. Thus a graph is 1-polyhedral if and only 
if it is the complete graph on two vertices; 2-polyhedral if and only if it 
is a circuit; 3-polyhedral if and only if it is a 3-connected graph (not so 
obvious). It is also known that every k-polyhedral graph is k-connected 
and it contains a subgraph which is isomorphic to within vertices of degree 
2 with the complete graph on k + 1 vertices (i.e., one which is obtained 
by inserting additional vertices in the edges of this complete graph). Any 
3-connected subgraph of a 3-polyhedral graph is again 3-polyhedral. 
This need not be the case for k > 4. The foregoing conditions are not 
sufficient for polyhedrality, at least fork = 4.25 The complete graph on 
n vertices is (n — 1)-polyhedral, the polyhedron being called a simplex. 

An astonishing fact is that every complete graph on n vertices with 
n > 5 can be shown to be 4-polyhedral. Open questions are whether for 
every graph the set of integers k for which the graph is k-polyhedral is 
consecutive or whether one can always join any two realizations in a given 
dimension by a continuous family of realizations. New polyhedral graphs 
may be generated from their corresponding polyhedra by (1) taking the 
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dual polyhedron in which the roles of the faces are interchanged, i.e., with 
a k-dimensional face of the polyhedron is associated an (n — 1)-dimen- 
sional face of its dual—corresponding incidences are also preserved, 
(2) cutting a k-dimensional face of an n-polyhedron by a new (n — 1)- 
dimensional face close to it, (3) joining two n polyhedra along a common 
face, (4) taking the convex hull of two polyhedra of possibly different 
dimensions contained in skew linear spaces, e.g., the formation of pyra- 
mids, (5) taking the direct product of two polyhedra of possibly different 
dimensions n and m, for example, formation of prisms. The resulting 
polyhedron is (n + m)-dimensional. 


4-3 THE COMPLEMENTARY GRAPH 


Consider a simple graph with n vertices. We obtain the complementary 
graph by deleting from the complete (simple) graph with n vertices all 
those edges that occur in the original graph. A graph is self-comple- 
mentary if it is isomorphic with its complement. Thus the graph of 
Fig. 4-9 is self-complementary.*® 

Exercise 4-7 Solve the following problem by means of graph theory. Tom, 
Dick, and Harry are married to Jane, Mary, and Susan, but not necessarily in that 
order. Each couple has one child. The children’s names are Emily, Alan, and 
Michael. Group together the names in each family by using the following information: 
(1) Mary’s child and Harry’s child are the two stars of the school football team. 
(2) Tom’s son is not Alan. (3) Dick’s wife is not Susan. 


Hint: There are nine vertices in this graph in groups of three. Each member of 
a group is connected to exactly one member of the others. Consider the comple- 
mentary graph, ignoring edges joining two vertices in the same group. 


Before we give an interesting theorem on complementary graphs, 
we consider the Euler relation 


n-m+r=2 
and the fact that 


3n < 2m n > 4 (a useful relation, not needed here) 
38r < 2m r>4 


always hold. If we use the second inequality in Euler’s formula, we have 


n-it+r22 
or r<s2n-—4 


By substituting in Euler’s relation and simplifying, we 
obtain 
m<3n-6 


Figure 4-9 which holds for any planar graph. Consequently, a graph 
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is nonplanar if 
m > 3n—6 


Theorem 4-16 IfG isa graph of n vertices and G’ is its complement, 
then: (1) if n < 8 at least one of the two is planar, (2) if nm > 8 at least 
one of the two is nonplanar, and (3) if n = 8 one or the other or both may 
be planar or nonplanar. 

Proof: For case n > 11, note that m+ m = n(n — 1)/2 where m 
is the number of edges of G’. If m > m, thenm > n(n — 1)/4, otherwise 
m > n(n — 1)/4. In either case, forn = 11 we have n(n — 1)/4 = 27.5 
and 3n — 6 = 27, from which either m or m is greater than 38n — 6 and 
the result follows. For n > 11 a similar argument applies. 

The argument for cases 9 and 10 is elaborate and will not be given 
here. It can be shown by exhaustion*®* that for n = 9 every planar 
graph with 3n — 6 edges has a complement containing one of the two 
Kuratowski subgraphs. From that it also follows that a graph with 
n = 10 vertices and 3n — 6 edges has a nonplanar complement, since 
if we suppress a vertex with its connecting edges we obtain a planar 
graph on 9 vertices where, by the foregoing, the complement is nonplanar 
and is contained in the complement of the graph with n = 10 vertices 
under consideration. 

For n = 8, first consider a graph with four utilities and four houses. 
This graph is nonplanar and its complement is obviously planar. Again 
consider the Kuratowski utility subgraph with two isolated vertices. It 
has 9 edges and its complement has 19 edges, which exceeds 18, the maxi- 
mum number of edges possible in a planar graph of 8 vertices. Finally 
consider two concentric squares. Label the vertices of the inner square 
1, 2, 3, 4 and those of the outer square 5, 6, 7, 8 where 5 is near 1, 6 near 
2, 7 near 3, and 8 near 4, The following pairs give vertices to be joined 
with straight edges (1,2), (1,4), (1,5), (1,6), (2,3), (2,4), (2,6), (3,4), (3,6), 
(3,7), (3,8), (4,5), (4,8), (5,6), (5,8), (6,7), (7,8), and join with an exterior 
are 6 to 8. This isa planar graph. Its complement can also be shown 
to be planar. 

The cases n <8 are left as exercises to the reader. Note for 
example that by isolating a vertex in a graph, the complement would 
have a greater number of edges; this is helpful to remember for cases 
n<8. 

Theorem 4-17 If k and k’ are the chromatic numbers of a graph 
G with n vertices and its complementary graph G’, respectively, then** 


2Vn<skth<sntl 


nsw <(2$2)" 
Shi S\—9 
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Proof: Let n; vertices of G be colored with the ith color; then 
k 
u=n and max n; >” 
im id k 
Two vertices in G with the same color are connected by an edge in G’ 
and hence are colored differently. Thus k’ > max n; > n/k, or kk’ > n. 
Now from 
(k — k’)? >0 
we have 
(k +k’)? > 4kk’ 
This implies that 
k+h' >2V/n 


We now show by induction on n that k + k’ <n+1. The theorem 
is true forn = 1. Assume it is true for n vertices and adjoin a vertex v 
to the complete graph consisting of G and G’ taken together. Since v is 
connected to n other vertices, let m of them be adjoined to G to yield a 
graph H (with chromatic number h) and n — m to @’ to yield a graph H’ 
(with chromatic number h’). Clearly, we may have to use at most one 
more color in H and H' than we used in G and G’. Thus h <k+1, 
hs’ +1, andh+h’<n+3. If in facth=kh+1,h' =h +1, 
then if v and its edges in H are deleted, the chromatic number of H will 
be reduced. In thiscasem > kandn — m > k’, from whichk + k’ <n 
and again h + h’ < n + 2, completing the inductive proof. 

The theorem is demonstrated if we finally note that from 


(eb +k’)? > 4kk! 
and from (k + k’) <n + 1 we have 


r n+ 1\3 
ws(*$) 


4-4 COLORING THE EDGES OF A GRAPH 


Consider the complete graph on n vertices (which is clearly nonplanar 
for n > 5) and suppose that some edges of this graph are colored red and 
the remaining edges blue. What is the smallest number of triangles 
whose three sides have the same color (monochromatic triangles)? This 
problem was first solved by Goodman,** but here we give a simpler proof 
due to Sauve.*? 

Theorem 4-18 Ina complete graph on n vertices let b be the num- 
ber of triangles with three blue sides and let r be the corresponding num- 
ber of red-sided triangles; then the number of monochromatic triangles 
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(b + 7) satisfies 


s(s — 1)(s — 2) ifn = 28 
3 
b+r> ele — es FD) ifn =4s8+1 
28(s + 1)(4s — 1) inode 
3 


where s is a nonnegative integer. The bound is sharp; i.e., equality is 
attained for each n and some (red-blue) coloring. 

Proof: For each pair of edges meeting at a vertex, assign the weight 2 
if they have the same color and —1 if they do not, The weight at a 
vertex is the sum of the weights of pairs of edges meeting at the vertex. 
Also, the weight of a triangle is the sum of the weights of pairs of its 
edges meeting at each of its vertices. In general, each pair of edges 
belongs to a single triangle, and hence the sum w of the weights of all the 
triangles is the sum of the weights of all vertices. The weight of a mono- 
chromatic triangle is six, whereas all other triangles have weight zero, 
from which b + r = w/6. The problem now is to determine the smallest 
value of w/6 for all possible colorings. Ifn = 2s, then the weight at each 
vertex is a minimum when the maximum number of pairs of edges at the 
vertex have different colors. Now each vertex has 2s — 1 edges. If 
there are ¢ edges of one color meeting at a vertex v, then there will be 
2s — 1 — t edges of the other color also meeting at v and the weight at v is 
given by 

20% + 207-7 — ts — 1 — t) 


where C's is the number of combinations of ¢ vertices taken two at a time, 


ie, 
t! 


i 
C= Gopi 
eS eat}! 
Similarly, C+ = on 


The reader will readily verify that the above weight is minimum for 
t= s. Thus the total weight at all vertices satisfies 
w > 2s(s — 1)(s — 2) 
and Sy ne 
6 3 
A similar analysis produces the result for n = 4s + 1 with 2s edges of 
each color at each point. If n = 4s+ 3, then 2s +1 cannot be the 
number of edges of each color at each point, since the total (4s + 3) 
(2s + 1) is not an integer. Hence 4s + 2 of the points have 2s + 1 
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edges of each color and the remaining point has 2s edges of one color 
and 2s + 2 of the other. 

To prove equality when n = 2s, we follow the argument by Lorden.** 
Label the vertices v1, . . . , ¥%» and color with red any edge-connecting 
pairs of vertices the sum of whose indices is even and color with blue 
any edge of the opposite case. Vertices that have even indices are 
connected with red edges, the totality of which is called a red net. 
Those with odd indices (which exceed the even ones by unity if n is odd) 
are also connected pairwise with red lines (also forming a red net). Every 
triangle must have at least one edge in one of the two red nets, and hence 
there is no triangle that has all three edges colored blue. If n = 2s, 


each of the two red nets has s vertices, and hence (3) red triangles, and 


! 3 

s s+1 A b 

are {4 + 3 red triangles. Reference 40 proves equality for odd n. 
Theorem 4-19 In any complete graph on n vertices 


b+r (3) ES ie 1-2) 


where J; is the number of vertices to which the ith vertex is red-con- 
nected.** 

Proof: Incident with the vertex v; there are J; red edges and n — 1 
— I; blue edges, and hence there are J;(n — 1 — J;) pairs of edges of 
mixed color, each pair forming two sides of a triangle of mixed-color 
edges. Since the third edge is either red or blue, one of the two remain- 
ing vertices will have a red-blue pair. Hence the number of mixed color 


triangles is 


there are 2 (3) red triangles in the graph. Show that ifn = 2s + 1, there 


12 

3 3 In -1—-1,) 
i=1 

The total number of triangles (3) 

triangles b + r plus the number of mixed-color triangles, and the result 

follows. Similar questions can be asked** for the coloring of a graph 

using several colors, but there are no analogously simple theorems. 


is equal to the number of single-color 


4-5 COLORING REGIONS AND VERTICES: 
THE FOUR - COLOR PROBLEM 


The four-color problem is to prove or disprove that four colors are suffi- 
cient to color any planar map so that no two adjacent regions have the 
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Figure 4-10 Figure 4-11 


same color. (Meeting at a vertex is not considered as adjacency.) Fig- 
ure 4-10 shows that four colors are necessary. The conjecture was first 
presented in lectures of Mobius in 1840 and was made better known by 
De Morgan, who received it through Franci Guthrie around 1850. 
Cayley in 1878 observed that he had been unable to obtain a rigorous 
proof. In 1890 Heawood revised a wrong proof of the problem due to 
Kempe (1879) and was able to prove the sufficiency of five colors. An 
excellent account of earlier ideas on the problem is given in Ref. 2. 


Exercise 4-8 Four-color the regions of the map given in Fig. 4-11. 


Remark: If the degree of each vertex of a graph is at most k, it is intui- 
tively obvious that the graph can be (k + 1)-colored, because no vertex 
is joined to more than k other vertices. This suggests the question 
whether a smaller number of colors would suffice. The complete graph 
on k + 1 vertices has degree k at each vertex but requires k + 1 colors. 

It turns out that for all graphs of maximum degree k, except the 
graphs having one component which is a complete subgraph, fewer colors 
are sufficient. Thisis the theorem of Brooks” applied to a not necessarily 
planar graph, which we give here without proof. 

Theorem 4-20 Let G be a linear graph with n vertices and without 
loops such that no more than k edges (where k > 2) are incident with each 
vertex. Suppose that no connected component of @ is a complete k + 1 
graph. Then it is possible to color the vertices of G with k colors such 
that no two vertices have the same color. 


The Number of Regional Adjacencies 


Ina planar map, let us consider the construction of regions such that 
each region is adjacent to as many other regions as possible. We start 
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with two adjacent regions A and B. 
To construct the region C, we con- 


nect a point from the boundary of A 

\ + \ with another from the boundary of 

B by an are. To construct D, we 

connect the boundary of C with the 

boundary of A such that B would be 

adjacent to D, ete. By carrying out 

this operation n times, we obtain 

n + 3 regions, because we include the 

outside region also. Each time we draw an are we obtain three ares, the 

new one and a division of the two arc segments it touches into four are 
segments (Fig. 4-11la). 

Thus the total number of are segments is 3n + 3, including the initial 
three ares between A and B, Since each are adjoins two regions, we have 
a total of 6n + 6 adjacencies of regions. The average number of regions 
adjacent to any region is (6n + 6)/(n + 3) = 6 — 12/(n+ 3). Note 
that we could have started the process by constructing A and B and then 
carrying out m — 2 other operations. Our number for » + 1 regions 
(including the outside region) would be 6 — 12/(n + 1). Thus in any 
case for a general map, the average number of regions adjacent to any 
region is less than 6. Note that this is an average and can, of course, be 
exceeded for individual regions but compensated for by a small number of 
adjacencies for other regions. 

Another proof that the average of the number of edges bounding the 
faces of a map is less than six uses Euler’s formula excluding the outside 
region, that is, n — m +r = 1, and the relation 3n < 2m, which when 
combined give m < 8r — 3. If the regions are numberedi = 1,... ,7 
and if the number of edges of the ith region is e;, then 


Figure 4-lla 


r 
Y « <2m <6r—6 


i=1 
since some but not all edges are on two faces. Hence, 
as) e¢ <6—- g< 6 
T ir 
It follows that there is at least one vertex having degree 5 or smaller. 
Yet another proof of this fact appears in the proof of Theorem 4-21. 
Sufficiency of Five Colors and Other Theorems 


This section is concerned with theorems illustrating in part how far 
the four-color conjecture has gone toward a proof. 
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Theorem 4-21 Five colors are sufficient for coloring the regions of 
a planar map. 

Proof: We prove the theorem by induction on the dual; i.e., we color 
the vertices. Assume it is true for n — 1 vertices. The graph has at 
least one vertex v whose degree <5. If this were not true, then by using 
the relationship between the faces and edges 3r < 2m and 6n < 2m (that 
is, we assume the contrary that every vertex has at least degree 6) and 
substituting in Euler’s formula, we obtain 0 = 2m/6 — m + 2m/3 > 2,a 
contradiction. Now if we remove v from the graph, the resulting graph 
can be colored with five colors by our induction hypothesis. Let us now 
consider the worst possibility, i.e., that five vertices, . . . , vs are adja- 
cent to v and arranged in clockwise direction. Suppose that the coloring 
of the graph without v (when v is removed, the edges incident with it are 
also removed) assigns a different color to each of these vertices (otherwise 
one of the remaining colors could be assigned to v and the theorem is 
proved). Let the respective colors bec, . . . , ¢s. We now show that 
it is possible to reallocate the colors so that v will receive a color different 
from the vertices with which it is incident, i.e., at least two of these 
vertices will be assigned the same color. Consider the subgraph of ver- 
tices colored with c; and c; (the same colors as those of »; and v3). If v1 
and v3; are not connected (i.e., there is no path between them) in this 
subgraph, then the vertices colored with c, of the component which con- 
tains v; can be colored with cs and those colored with c; in that component, 
are now colored with c;. In this manner both 2 and »v; receive the color 
cs; and » can be colored with c. If, on the other hand, »; and »; are con- 
nected in the subgraph of vertices colored with ¢; and c; then, for example, 
vg and v4 cannot be connected in the subgraph of vertices colored with cz 
and c;. Otherwise, the path connecting them must meet the path con- 
necting v; and v; and the vertex where the two paths meet will have 
assigned to it one color from one subgraph and a second color from the 
second subgraph. This subgraph which includes v, and v4 can be recolored 
in the same manner as the disconnected case for v; and v3 above, and hence 
ve and v4 receive the same color. The other color is then assigned to v. 

Theorem 4-22 Two colors are sufficient for coloring the regions 
generated by the intersection of straight lines in the plane. 

Proof: The proof is inductive. A plane is divided by a straight line 
into two regions, and hence two colors are sufficient. When a second line 
is drawn, the new map is recolored by reversing the colors on one side of 
the new line. Suppose the theorem is true for n — 1 lines. When the 
nth line is drawn, if the colors of all regions on one side of the line are 
reversed, we obtain a total coloring with two colors. 

Theorem 4-23 A necessary and sufficient condition for a map to 
be properly colorable (i.e., regions with a common edge colored differently) 
with two colors is that every vertex have even degree >2. 
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Proof: The faces of a planar graph can be properly 2-colored if and 
only if the vertices of its dual can be properly 2-colored, i.e., if and only 
if the dual is bipartite. Using the fact that a tree is bipartite, and con- 
sidering its chords, one can show that a graph is bipartite if and only if 
every circuit has an even number of edges. Further, for planar graphs, 
every circuit will have an even number of edges. Still further, for planar 
graphs, every circuit will have an even number of edges if and only if 
every circuit which bounds a face has this property. (The reader should 
verify this step in detail.) But this is equivalent to the assertion that 
every vertex of the original graph has even degree. 


Exercise 4-9 Prove that the map obtained by drawing any finite number of 
circles in the plane can be properly colored with two colors. 


Exercise 4-10 Use the fact that if every vertex has degree at least 3, then at 
least one region has at most five sides, to develop a short inductive proof of the suffi- 
ciency of six colors for the proper coloring of any planar map. 


Exercise 4-11 Show that if every vertex of a map has degree at least 3, then the 
number of edges and regions satisfy 3r > 6 + m. 


Exercise 4-12 Use the result of Exercise 4-11 to show that if every vertex has 
degree at least 3, and if there are fewer than 12 regions, then at least one region is 
bounded by 4 or fewer sides, 


Exercise 4-13 Prove that four colors are sufficient to color every map which has 
fewer than 12 regions and for which every vertex is of degree at least 3. 


The flow ratio of a circuit in a directed graph is the ratio of the 
number of ares directed in one direction to the number of ares directed 
in the opposite direction, where the denominator must not be more than 
the numerator.** Note that this ratio may be infinite. 

Theorem 4-24 A necessary and sufficient condition that the 
vertices of a graph can be k-colored is that there exist an orientation of 
the edges of the graph such that the flow ratio of each cireuit does not 
exceed k — 1. 

Proof: To show necessity, assume that the graph is k-colored and the 
colors are labeled 0,1, ...,%—1. Let each are be directed from the 
vertex with the smaller color label to that with the larger color label. 
Then each circuit has flow ratio <t — 1. To see this, note that the 
largest flow ratio is obtained by orienting as many ares as possible in one 
direction and as small a number as possible in the opposite direction. 
This ratio is obtained, for example, by successively coloring the vertices 
in increasing order and then starting again when all colors are used. In 
this manner for every set of k — 1 arcs oriented in one direction there is 
one are oriented in the opposite direction. 

To prove sufficiency, suppose the graph is connected and let the flow 
ratio be <k — 1 for each circuit. Select a starting vertex vp and color it 
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with the color 0 and consider some other vertex vp. We shall define an 
auxiliary integer-valued function g(v,) whose values when reduced mod k 
yield the desired coloring. As a preliminary operational step for defining 
g(vp), we define the gain of a chain of arcs, when traversed from vo to vp, 
to be the number of ares traversed in their inherent direction minus 
k — 1 times the number of arcs traversed in the opposite sense. 

To show that such a chain with maximum gain exists, note first that 
if a chain is not simple and hence contains a circuit, the circuit cannot 
contribute to the gain because of the flow ratio condition and hence it 
may be deleted from the chain to yield a chain with equal or greater gain. 
Again, if the resulting chain is not simple, a circuit is deleted, and so on 
until a simple chain is obtained. Since the number of simple chains is 
finite, there is one with maximum gain. If with each vertex vp is asso- 
ciated the maximum gain g(v,) for a chain oriented from vp, then we have 
for two vertices vp, and v», joined by an are 


0 <|9(s,) — gr,)| < k 


for if the absolute value of the difference exceeds k — 1, then the max- 
imum gain at one of the vertices can be taken as that at the larger one 
minus a loss of k — 1 along the connecting arc. Thus g(vp) when reduced 
mod k yields a k-coloring of the vertices, since the values of g for adjacent 
vertices differ by less than k and hence cannot produce the same integer 
mod k. The four-color conjecture can now be stated as follows: The 
edges of a planar graph can be so oriented that the flow ratio of each circuit 
is <3. 


Trivalent Maps 


To show how the problem of coloring the regions of any map can be 
reduced to that of coloring the regions of a trivalent map (one which is 
regular of degree 3), we show how the vertices of the original map whose 
degree differs from 3 can be reduced to vertices of degree 8. This reduc- 
tion is accomplished by replacing any vertex of degree #3 with a closed 
polygonal region with as many vertices as there were edges incident with 
the original vertex. Each of the new vertices has one of these edges 
incident with it and is of degree 8. The resulting map is trivalent (see 
Fig. 4-12). When this map has been colored, a coloring of the original 
map is obtained by shrinking each of the new regions back to its original 
vertices. Thus if four colors suffice for coloring a trivalent map, then 
they also suffice for coloring the original map. 

We shall say that the regions or edges of a planar map have been 
properly colored if they have been assigned colors in such a way that no 
two adjacent regions or edges, respectively, have the same color. 

The following theorem, essentially due to P. Tait and discussed in 
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Enlargement of Enlargement of a Enlargement of a vertex of 
a vertex of degree 1 vertex of degree 2 degree 4 


Figure 4-12 


Ref. 3 and Ref. 34 of this chapter, relates the problem of 4-coloring the 
regions of a trivalent map with that of 3-coloring its edges. 

Theorem 4-25 If @ is a planar, trivalent, 2-connected (i.e., inar- 
ticulate) map, then the regions of @ can be properly 4-colored if and only 
if the edges of G can be properly 3-colored. 

Proof: Assume that the regions have been properly 4-colored with 
colors a, b,c,andd, Assign to each edge of G color 1, 2, or 3, according to 
the colors of the two regions which it bounds, as prescribed by the follow- 
ing table: 


a c 


[Rwwl| a 


b 
1 
3 
2 


(Note that the assumption that G is inarticulate is necessary in order to 
assure that each edge does in fact bound two different regions.) If two 
adjacent edges had the same color, it would follow that two adjacent 
regions had the same color. Since this is not the case, it follows that the 
edges have been properly colored. 

Conversely, assume that the edges have been properly colored with 
colors 1, 2, and 3. The subgraph determined by the edges colored 1 
and 2 is bivalent; hence, its regions can be colored with two colors A 
and B. (Why?) Similarly, the bivalent graph determined by the 
edges colored 1 and 3 has regions which can be colored with two colors 
C and D. When the latter two graphs are superimposed, every region 
of the original graph has one of the four color pairs AC, AD, BC, BD 
associated with it. If a different color is associated with each pair, the 
resulting coloring of the regions of G is readily shown to be proper. This 
completes the proof. 

Theorem 4-26, in conjunction with Theorem 4-25, establishes 
another characterization of the problem of properly 4-coloring the regions 
of a trivalent graph. This characterization is in terms of numbers asso- 
ciated with the vertices of the graph, and is due to Heawood.*” See also 
Ref. 4 of Chap. 1 and Ref. 3 of this chapter. 
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Theorem 4-26 Let G be a planar, trivalent, 2-connected graph. 
Then the edges of G can be properly 3-colored if and only if a coefficient 
k(v), equal to +1 or —1, can be associated with each vertex v in such a 
way that 

Dk(v) = 0 (mod 8) 


where the summation is taken over the vertices occurring in the bound- 
ary of any region of G. 

Sketch of proof: Assume that the edges of G have been properly 
colored with colors 1, 2, and 3. Assign to each vertex v the coefficient 
+1 or —1 depending on whether a clockwise movement about v results 
in encountering the edges incident with » in the color order 1, 2, 3 or 
1, 3,2. Now start with any edge e and traverse the edges which bound 
some region until ¢ is reached again, determining the color of each succes- 
sive edge by means of the color of the preceding edge and the coefficient 
of their common vertex. In order for the color ultimately determined 
for e to agree with its actual starting color, the mod 3 condition must 
be satisfied. 

Conversely, suppose a coefficient has been associated with each 
vertex in such a way that the mod 3 condition is satisfied, relative to every 
region. Assign color 1 to an arbitrary edge and sequentially color the 
remaining edges as follows: If an uncolored edge e is adjacent to a colored 
edge f, assign to e the color which is consistent with the color of f and 
with the coefficient of their common vertex, using the clockwise conven- 
tion given earlier. The fact that the resulting set of edge colors is proper 
follows from the mod 3 condition, which essentially guarantees that no 
color conflicts will arise during the course of this procedure and that the 
edges may be colored in any order. This completes the sketch of the 
proof. 

Theorems 4-25 and 4-26 can also be viewed in the dual context of 
triangular graphs, i.e., planar graphs such that every region (including 
the infinite region) is bounded by precisely 3 edges. The dual of a tri- 
angular graph is a 2-connected, trivalent, planar graph, and conversely. 
A proper 3-coloring of the edges corresponds, in the dual context, to a 
3-coloring of the edges of a triangular graph such that the three edges 
bounding any region have distinct colors. 


Exercise 4-14 Reinterpret the mod 3 condition of Theorem 4-26 in the dual 
context of triangular graphs. 


Theorem 4-27 (Three-color Theorem.) The regions of a tri- 
valent map can be colored with three colors if and only if every region 
is bounded by an even number of edges. 

Proof: “If” is proved by induction on the number of regions and 
hinges on the fact proved in Theorem 4-21 that there is at least one vertex 
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of degree no greater than 5. In the dual this implies that there is at least 
one region bounded by at most five edges, so there is a region bounded 
by exactly two or four edges. In the two-edge case let R, and R, be the 
regions adjacent to this region. If we eliminate one of the two bounding 
edges, absorb the region, and delete the vertices of degree 2, thus replac- 
ing their edges by a single edge incident with their adjacent vertices, by 
an inductive hypothesis on the number of regions we can 3-color this 
map. Insertion of the original region requires the third color that is 
not represented in either R, or Ry. The case for a region R bounded by 
four edges and adjacent to regions R,, Rs, Rs, Ry requires the removal 
of two opposite edges and also the removal of four vertices of degree 2. 
We assume that the edges removed bounded regions R, and R;. It is 
easy to show that all resulting regions have an even number of edges 
and that the resulting map is still trivalent and has two fewer regions. 
The region R; originally had an even number of neighboring regions, one 
of which was R. When the reduced graph is colored, the two neighbors 
of R, which were also neighbors of R, that is, R. and Ry, necessarily have 
the same color because the neighbors of R; must be alternately assigned 
two colors. Also, R, and R; have the same color because they are part 
of the same region in the reduced graph. When the two edges are 
replaced, R may be assigned the remaining third color. 

To prove the “only if” part, suppose that R is bounded by an odd 
number of edges and hence has an odd number of neighbors. The neigh- 
boring regions must be alternately colored with the two colors other 
than the color assigned to R, but this is impossible. 


Exercise 4-15 Dualize the foregoing theorem and give a proof in the dual 
context. 


Chromatic Polynomials 


Let P,(\) be the number of ways in which a map of n regions which 
covers the sphere can be colored with <) colors. The four-color con- 
jecture states that 4 is not a root of any equation P,(\) = 0. Two 
colorings are essentially different if they cannot be obtained from one 
another by a permutation of colors. If for the moment we disregard 
the possible permutation of colors and if m; (that is, 1, ... , m) is the 
number of ways of coloring a map with exactly 7 colors, then there are 
ma(A — 1) > + + (A — 4+ 1) ways to color the map with 7 colors taken 
from \ colors. Clearly, the total number of ways is an nth degree poly- 
nomial known as the chromatic polynomial, giving 


P(A) = mA + mad — 1) + °° +mdAA—1)°°* A-—n2+) 


Birkhoff* has given an explicit expression for this by finding the values 
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of my. It is 


n-t 


n 
P,(d) = > a ay (—1)*(4,k) 
i=1  k=0 
where (i,k) is the number of ways to break down the map of n regions 
to a submap of 7 regions by k simple or multiple coalescences (i.e., merging 
two or more regions, respectively, by removing edges). Thus (7,k) = 0 
for k > n — 7%, and by definition (n,0) = 1, (1,0) = 0 for i<m and 
(in — 7) is the number of ways to make n — 7 successive simple coales- 
cences. Thus, letting n = 3, we have 


(2,1) =3 (1,1) =1 (1,2) =3 
and 


P(A) = (3,0)A* — (2,1)a* + [— (1) + (1,2)]\ = AA — YA — 2) 


Exercise 4-16 Consider a map of five regions formed by a ring of three regions 
bounding an interior and an exterior region. Show that 


(41) =9 (31) =22 (3,2) = 51 
(21) =14 (2,2) = 125 (2,3) = 150 
Qj1)=1 (,2)=45 (1,3) =176 (1,4) = 150 
and P(A) = 4* — 94 + 298 — BOA? + 18A = AA — 1)(A — 2)(A — 8)? 


Birkhoff* has also shown that for a map on the sphere 
P,Q) >AA—DA—2A—3)% xn2e3,nA4 


which is obviously true for \ = 1,2,3 and with \ > 4 forn = 3,4. A 
map is defined to be maximal if every region in it touches (along an edge) 
the largest possible number of regions, i.e., there is no other map with 
the same adjacency of corresponding regions but with an additional 
regional adjacency. The necessary and sufficient conditions that a map 
of m > 3 regions be maximal are that all of its regions be simply con- 
nected (i.e., homeomorphie with a circle), its vertices be trivalent, and 
no two regions touch more than once. In such a map the number of 
adjacencies is 3n — 6. 

A map is irreducible if all its regions are simply connected, any two 
adjacent regions form a simply connected region, and any three regions 
that are adjacent in pairs form a simply connected region about a triva- 
lent vertex. For such a map 


TI Pp,(A) 


PAO) = em = DFO = OP 


mh<n 


where the polynomials P,,() belong to irreducible maps and are a + 6 + 
y +1 in number and n = 2n; — a — 26 — 3y. 
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A substantial characterization of P,(\) for different values of \ is 
given in Refs. 8 and 9. 


4-6 GRAPHS AND SURFACES 


One often avoids the definition of a surface by appealing to the intuition 
of the reader in a study in which a rigorous definition would not materially 
improve understanding. It has been our intention to follow this intui- 
tive appeal generally. But for the student who is curious about how a 
surface is defined, we shall now give some definitions which require famili- 
arity with ideas from topology, a subject which provides a basis for under- 
standing many modern mathematical concepts. 


A 1-1 transformation which together with its inverse is continuous is called a 
homeomorphism. An example of a homeomorphism is the continuous deformation 
of a pyramid into a sphere, and vice versa. Note that one cannot establish a 
homeomorphism between a sphere and a torus (a donut-shaped surface) because 
the latter has a hole. 

An n-dimensional topological manifold, or simply an n manifold, is a con- 
nected, locally compact topological space with a countable basis, each point of 
which has a neighborhood homeomorphic to euclidean n space." 

When a manifold is compact, it is said to be closed, for example, the n sphere; 
otherwise, it isopen. A closed 2 manifold that is homeomorphic to a polyhedron 
is called a closed surface. It is known that every closed 2 manifold is homeo- 
morphic to some polyhedron. An example of a closed surface is the 2 sphere, 
whereas the plane is an open surface. 

A geometric nsimplex 0”, n > 0, isthe set of points X = {z;j,2 =1,...,7, 
defined in terms of n + 1 linearly independent points Po... , Pa by x: = 

n 1” 

» a;p;i, where yy aj =1, 0<a;<1,j7=0,..., m, and p; is the ith 
j=0 j=0 

coordinate (i = 1,...,n) of pj (j =0,...,n). The n simplex is closed if 
0<a;<1,j =0,...,mholds, A geometric simplicial complex K is a finite 
set of disjoint n simplexes, n = 0, . . . , pin n-dimensional euclidean space such 
that if a simplex belongs to the complex, so do all its faces and no two distinct 
simplices have their faces all the same. The dimension of the complex is p. A 
polyhedron is the point-set union of all the simplexes of K and hence is the set of 
points which belong to some simplex of K. 

A surface S is unorientable if for some point around which is associated a 
direction of rotation it is possible to move the point on the surface while keeping 
the same orientation about the point and return to the initial position with a 
reversed sense of rotation. If there isno such point, the surface is called orientable. 

If the right end of the rectangle 


A B represented in Fig. 4-13 is given a half 
turn and then joined to the left end, 
| { we obtain a band which differs from a 


ring by the presence of the half turn. 
It is known as the Mébius strip. Note 
Figure 4-13 that it has only one side in that if one 
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starts at any point on this surface and traces a curve around the surface back to 
the point, one covers “both sides” without crossing an edge of the surface, and it 
can be seen to be unorientable. The sphere and the plane, on the other hand, are 
examples of two-sided orientable surfaces. 

A map is a graph together with a surface which contains the graph in such a 
way that edges intersect only at their end points. 


Euler’s formula actually applies to a polyhedron in three dimen- 
sions. In this case we have V — £ + F = 2, where V is the number 
of vertices, # is the number of edges, and F is the number of faces. The 
proof is obtained by cutting out one of the faces and stretching the poly- 
hedron on a plane without any intersecting edges. The proof proceeds 
exactly as in the planar graph case except that the outside region is not 
counted, Instead of it one includes the missing face which has been 
cut out. The following theorem, which applies to polyhedra, is given 
because of a similarity of the argument in the proof to that used in graph 
theory. 

Theorem 4-28 FEvery closed polyhedron has at least two faces 
with the same number of edges. 

Proof: Let e; be the number of edges of the 7th face where we order 
the labeling such that* e; < e41, i= 1,2,...,27—1, where r is the 
number of faces. If no two faces have the same number of edges, then 
G41 — €: > 1 forallz. Thus 


r=i 
& — a= yy (Gin1 — 6) >r—-1 
i=1 
Since e; > 3, we have e, > r + 2, that is, the rth face is adjacent to at 
least r + 2 faces, a contradiction. 

Remark: One can prove the stronger result that if k is the smallest 
number of edges of a face, then there are at least k indices ¢ for which 
ey = Ci41- 

Remark: Buler’s formula can be generalized to a polyhedron in n 
dimensions, where we use F’;, i = 0,1, . . . ,m — 1, to denote the number 
of ith dimensional faces; for example, /’y is the number of vertices. In 
that case Euler’s formula is 


n=-1 
y (-iF; = 1 - (-1) 
i=0 
where the right side is 0 or 2 depending on whether n is even or odd, 
respectively. 

Returning to three dimensions, we note that Euler’s formula for 
surfaces with holes is 


V-—E+F=2-—2p 
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where p is the number of independent holes and is referred to as the 
genus of the surface. The genus of the surface is the largest number of 
simple closed curves on the surface which do not disconnect the surface. 
The sphere is of genus 0, since any simple closed curve on its surface 
disconnects it. The torus (a donut) is of genus 1. Any surface of genus 
p is equivalent to a sphere with p handles (like the handle on a tea cup). 
For a nonorientable surface the right side of Euler’s formula reads 2 — q, 
where gq is the genus. 

To derive the formula, one may think of the sphere as a polyhedron 
with handles. A handle may be cut along one joint, leaving a poly- 
hedral-shaped hole in the surface left behind, and may then be pushed 
into the other joint, creating a similar hole. The reader can now show 
that V — 2 + F = 2, which holds for any subdivision of the entire 
sphere, must be reduced by 2p, which is the number of patches on the 
surface. 

We now derive Heawood’s formula which gives for p > 1 


cE + v1 Hp] 


as a sufficient number of colors for a map on a surface of genus p. The 
brackets indicate this is the greatest integer not exceeding the number 
being taken. Note that for a nonorientable surface the result is [(7 + 
~/1 + 24q)/2]. Both foregoing formulas give a lower bound to the 
number of colors when each is reduced by 2.** 

The same procedure of obtaining a trivalent map from a planar 
map can also be followed for a map on a surface of genus p. We shall 
be concerned only with connected maps, because it is not difficult to see 
that the coloring of a disconnected map is a special case of this. 

Now in a trivalent map we have 3V = 2H = aF, where a is the 
average number of edges for a given face. By substituting in Buler’s 
formula for a surface of genus p, we obtain 


12(p — 1) 
F 


We add 1 to each side, put a+ 1 = F, solve the quadratic in F, 
and take the positive root a to obtain Heawood’s formula. To prove 
that the greatest integer contained in @ gives the sufficient number of 
colors for a surface of genus p, we note that if F < a, then this number a 
is sufficient since F colors are sufficient, F being the number of regions. 
If F > a, then along the curve we always have a + 1 < a, as the reader 
will verify from Fig. 4-14. 

From the fact that a is the average number of sides of a face we 
conclude that there is a face R with at most a sides and thus it has at 


a=6+ 
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Figure 4-14 


most [a] — 1 sides, since a + 1 < [a]. If an edge of this face is removed 
so that the face is combined with one of its adjacent faces R’, then we 
can show that [a] colors are sufficient for the given graph if [a] colors are 
sufficient for the reduced graph. For suppose that the reduced graph 
has been colored with [a] colors; then since R’ and the other regions of 
the reduced graph which bounded R are at most [a] — 1 in number, 
R can be colored with the remaining available color. If the reduced 
graph has more than [a] regions, we repeat this argument, i.c., we show 
that to color it with [a] colors would follow if we could color the reduced 
graph obtained by merging another region having at most [a] — 1 sides 
with one of its neighbors. (Again, such a region must exist, since the 
average number of sides a still satisfies a + 1 < [a] because the number 
of regions F is still > [a].) Thus the coloring of the original graph with 
[a] colors eventually is made to rest on the possibility of coloring a reduced 
graph of [a] regions with [a] colors, which is clearly always possible. 
Note that each of the merged regions each having at most [a] — 1 sides 
can be reinstated and colored without conflict with its neighbors. This 
completes the proof of the sufficiency of [a] colors. 

Necessity has been proved for various values of p and requires the 
construction of special maps (see Ref. 2). 

The necessity of seven colors on the torus is illustrated by the map 
of Fig. 4-15. Heawood’s formula shows that seven colors are also suffi- 
cient. Here opposite edges must be identified to obtain a torus. 


Exercise 4-17 Show that six colors are necessary for some maps on a Mébius 
strip. 


An interesting theorem due to J. Edmonds is concerned with the 
relation between a graph G and the surface maps (S,@) to which it can 
give rise. The problem is to characterize maps on orientable surfaces 
which are the corresponding surface drawings of a given graph. 
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Figure 4-15 


It is intuitively clear that if we have a map (S,G) corresponding to a 
graph G, then the map determines at each vertex a unique cyclic ordering. 
It turns out that these cyclic orderings of the edges to each vertex are 
precisely the link between graphs and maps, since we also have the fol- 
lowing theorem: 

Theorem 4-29 (Edmonds.) For any connected graph G with an 
arbitrarily specified cyclic ordering of the edges at each vertex there exists 
a topologically unique oriented surface map (8,@), such that the clockwise 
edge orderings around each vertex are as specified. 


REFERENCES 


1 Aleksandrov, P. S.: ‘Combinatorial Topology,” vol. 1, Graylock Press, 
Rochester, N.Y., 1956. 

2 Ball, W. W. R.: “Mathematical Recreations and Essays,’’ The Macmillan 
Company, New York, 1960. 

3 Berge, C.: “Theory of Graphs and Its Applications,” John Wiley & Sons, 
Inc., New York, 1962. 

4 Birkhoff, G. D.: A Determinant Formula for the Number of Ways of 
Colouring a Map, Ann. Math., 14(2): 42-46 (1912). 

5 Birkhoff, G. D.: On the Number of Ways of Coloring a Map, Proc. Edinburgh 
Math. Soc., 2(2): 83-91 (19380). 

6 Birkhoff, G. D.: The Coloring of Graphs, Ann. Math., 33(2): 688-718 (1932). 

7 Birkhoff, G. D.: The Reducibility of Maps, Am. J. Math., 35: 115-128 (1913). 

8 Birkhoff, G. D.: On the Polynomial Expressions for the Number of Ways of 
Coloring a Map, Ann. Scuola Normali Superiore, Pisa, Ser. 2, 3: 85-104 (1934). 

9 Birkhoff, G. D., and D. C. Lewis: Chromatic Polynomials, Trans. Am. Math. 
Soc., 60: 355-451 (1946). 

10 Brooks, R. L.: On Colouring the Nodes of a Network, Proc. Cambridge Phil. 

Soc., 37: 194-197 (1941). 


Planar and nonplanar graphs, coloring theorems 97 


11 Dynkin, E. B., and W. A. Uspenski: “Multicolor Problems,” D. C. Heath 
and Company, Boston, 1952. (Original in German.) 

12 Dirac, G. A.: Note on the Colouring of Graphs, Math. Z., 54: 347-353 (1951). 

18 Dirac, G. A.: A Property of 4-Chromatic Graphs and Some Remarks on 
Critical Graphs, J. London Math. Soc., 27: 85-92 (1952). : 

14 Dirac, G. A.: Some Theorems on Abstract Graphs, Proc. London Math. Soc., 
Ser. 8, 2: 69-81 (1952). ‘ 

15 Dirac, G. A.: The Structure of k-Chromatic Graphs, Fund. Math., 40: 
42-45 (1953). 

16 Dirac, G. A.: Map Colour Theorems Related to the Heawood Colour 
Formula, J. London Math. Soc., 31: 460 (1956). 

17. Dirac, G. A., and M. D. Stojakovic: The Four-colour Problem, Matematicka 
Biblioteka, 16: 1960. MR 22-1046. 

18 Erdés, P., and N. G. de Bruijn: A Color Problem for Infinite Graphs and a 
Problem in the Theory of Relations, Indagationes Math., 13: 369-373 (1951). 

19 Errera, J.: Du Coloriage des Cartes, these, Bruxelles, 1921, Mathesis, 36: 56 
(1922). 

20 A problem in Zureka, October, 1960. 

21. Fary, I.: On Straight Line Representation of Planar Graphs, Acta Sci. Math., 
11(4): 229-233 (1948). 

22 Franklin, P.: The Four Color Problem, Scripta Mathematica, No. 5, 1941. 

23 Goodman, A. W.: On Sets of Acquaintances and Strangers at Any Party, 
Am. Math. Monthly, 66: 778-783 (1959). 

24 Greenwood, R. E., and A. M. Gleason: Combinatorial Relations and Chro- 
matic Graphs, Can. J. Math., 7: 1-7 (1955). 

25 Griinbaum, B., and T. 8S. Motzkin: On Polyhedral Graphs, Proc. Symp. Pure 
Math., vol. 7, Convexity, 1963. 

26 Harary, F.: A Complementary Problem on Nonplanar Graphs, Math. Mag., 
35: 301-303 (1962). 

27 Heawood, P. J.: Map Color Theorem, Quart. J. Pure Appl. Math., no. 24, 
p. 332, 1890. 

28 Kneebone, G. T.: The Three Houses Problem, Math. Gaz., 25: 78-81 (1941). 

29 Krausz, J.: Demonstration Nouvelle d’une Théoréme de Whitney sur les 
Reseaux, Mat. Fiz. Lapok, 50: 75-85 (1943). (In Hungarian.) 

30 Kuratowski, G.: Sur le Probléme des Courbes Gauches en Topologie, Fund. 
Math., 15-16 (1930). 

31 Linis, V.; Math. Mag., 36(4): (1963). 

32 Lorden, G.: Blue-Empty Chromatic Graphs, Am. Math. Monthly, 69(2): 
114-119 (1962). 

33 MacLane, 8.: A Combinatorial Condition for Planar Graphs, Fund. Math., 
28: 22-32 (1937); Zbl. Math., 15, 375. 

34 Marathe, C. R.: On the Dual of a Trivalent Map, Am. Math. Monthly, 
68(5): 448-455 (1961). 

35 Minty, G. J.: A Theorem on n-Coloring the Points of a Linear Graph, Am. 
Math. Monthly, 69(7): 623-624 (1962). 

36 Nordhaus, EB. A., and J. W. Gaddum: On Complémentary Graphs, Am. Math. 
Monthly, 63: 175-177 (1956). 

36a Picard, C.: Graphes Complementaires et Graphes Planaires, Rev. Franc. 
Recherche Operationelle, 8: 329-343 (1964), 

37 Rapaport, E. $.: Cayley Color Groups and Hamilton Lines, Scripla Math., 
24: 51-58 (1959). 


98 Basic theory 


38 Ratib, I., and C. E. Winn: Généralisation d’une Réduction d’Errera dans 
le Probléme des Quatre Couleurs, Intern. Congr. Math., Oslo, 1936. 

39 Ringel, G.: “Farbungsprobleme auf Flichen und Graphen,”” VFB Deutscher 
Verlag der Wissenschaften, Berlin, 1959. 

40 Sauvé, Léopold: On Chromatic Graphs, Am. Math. Monthly, 68: 107-111 
(1961). 

41 Seshu, S., and M. B. Reed: “Linear Graphs and Electrical Networks,” 
Addison-Wesley Publishing Company, Inc., Reading, Mass., 1961 

42 Tietze, H.: Einige Bemerkungen tiber das Problem der Kartenfarbens auf 
einseitigen Flichen, Jabresber., Deutsh. Math. Vereinigung, 19: 155-159 (1910). 

43 Zeidl, B.: Uber 4 und 5-chrome Graphen, Monatsh. Math., 62: 212-218 
(1958). 


FIVE 


MATRIX REPRESENTATION 


5-1 INTRODUCTION 


In this chapter we introduce various matrices to represent inci- 
dence relations between the vertices and edges of a graph and, 
more generally, to represent incidence relations between circuits, 
cut-sets, chains, and the edges associated with them. These alge- 
braie representations apply to both directed and undirected 
graphs. Whether one manipulates these matrices or not, they 
are a convenient way to record the structure of the graph. There 
is a close relationship between the incidence, circuit, and 
cut-set, matrices whereby it is possible to obtain a basis for a 
certain vector space associated with one in terms of a basis asso- 
ciated with the space for another. This, for example, facilitates 
the task of determining all the cut-sets of a graph once its circuits 
are known. Properties of the cut-sets are closely related to the 
maximal flow through a network, as we shall see in Chap. 7. The 
notion of a tree, its branches, and chords play an important role 
in the formulation of basic matrices for circuits and for cut-sets. 
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In the case of undirected graphs we shall be dealing with incidence 
matrices whose entries are 0 and 1. The addition of numbers is always 
reduced mod 2. Thus, 1+1=0 (mod 2), 1+0=0+1=1, 
0+0=0. One simply adds the elements, divides by 2, and records 
the remainder as the result of applying the mod 2 operation. This 
choice of entries enables one to indicate whether a property between 
two elements holds (with entry 1) or does not (with entry 0). Matrices 
may be added and multiplied as usual, and then the entries of the result- 
ing matrix are recorded mod 2. For directed graphs we use 1, —1, 0, 
since an are, for example, may be incident with a vertex and directed 
away from the vertex, directed toward it, or not incident with it at all. 
The mod 2 reductions are no longer adequate in this case, and we must 
consider matrices with integer-valued entries. In fact, they are more 
cumbersome to deal with than the mod 2 matrices. However, basically 
the same kind of theory is obtained for the matrices of a directed graph 
as is obtained in the undirected case. (It is assumed here that the reader 
has a working knowledge of elementary matrix theory—the bare essen- 
tials.) With the idea of a matrix, one associates the concept of a vector 
space and related theory revolving about bases and linear combinations. 

Definition A real vector space is a set of elements (called vectors) 
with a rule for addition associating with the vectors « and y a unique 
vector x + y and another rule for scalar multiplication by real numbers 
(or the elements of any other field, such as the complex numbers or the 
integers modulo 2) associating with a vector x a unique vector az for 
every scalar a. 

In order to have a vector space, the following axioms must be 
satisfied for all «, y, z in the space and for all scalars a, b: 

The vectors must form a commutative group under addition, i.e., 


Lety=yto 

22+ yte2 =@+y) +2 

3. 0 + x = x for all x, where 0 is the zero vector 

4. Each vector x has an inverse y such that « + y = 0. 


It must also satisfy under scalar multiplication: 


5. a(ba) = (ab)a 

6. (a + b)a = ax + ba 

7. a(x + y) = ax + ay 

8. 1+ « = a for every x, where 1 is the multiplicative unit scalar 

9. There is a set of vectors which forms a basis, i.e., they are linearly 
independent and span the space (every vector can be expressed as a 
linear combination of them), Every basis has the same number of 
vectors, known as its rank, 


Two vector spaces V and U are isomorphic if there is a 1 — 1 cor- 
respondence between their elements which preserves addition and scalar 
multiplication. That is, if vy; and ve are in V and uw, and uz are the cor- 
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responding elements in U, then v; + v2 corresponds to uy + wu. Simi- 
larly, kv corresponds to ku, where v corresponds to u and where k is any 
scalar.® 

In the case of an undirected graph the elements of the vector space 
under consideration in our matrix representations are vectors, each of 
which represents a subset of the edges, whose components are 1 or 0 and 
indicate whether an edge belongs to the subset or not. In the case of 
an undirected graph having m edges, the vector space of interest con- 
sists of all vectors (1, . . . ,2m), Where 2; = 0 or 1. The ith component 
is associated with the 7th edge, and each vector uniquely characterizes a 
subset of edges, namely, those for which x; = 1. In particular, each 
circuit and cut-set has a vector representation. The sum of two such 
vectors is again a vector of the same form, and their scalar product (the 
sum of the products of corresponding components) is 0 or 1 depending on 
whether the two subsets corresponding to these vectors have an even 
or odd number of common edges. 

The set of all linear combinations of circuit vectors (i.e., vectors 
which correspond to circuits) determines a subspace consisting precisely 
of those vectors which correspond to circuits and to edge-disjoint sets of 
circuits, and similarly for vectors associated with cut-sets. 

An important and difficult question has been answered in recent 
years, namely, when does a given matrix represent a basis for the space 
associated with the circuits or cut-sets of a graph? This is the problem 
of the realizability of a matrix. We shall go into this question only 
briefly, because it involves considerable background material that is out- 
side the scope of this book. 


5-2 THE INCIDENCE MATRIX 


Let @ be a graph having n vertices and m edges. The incidence matrix 
associated with @ is the n X m matrix whose rows and columns cor- 
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Figure 5-1 
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respond to the vertices and edges, respectively, where each entry a, is 1 
or 0 depending on whether the jth edge is incident with the 7th vertex or 
not. For a loop, however, every column entry is 0, For example, the 
2-component graph shown in Fig. 5-1 has the following incidence matrix: 


€; €2 €3 G4 @5 6 C7 H €s @) 10 C1 

2) (0'.1,..0), 0, 0) 10, 0:0) (0: 0 40 
velO; Aaa gO: O10) Te Os OO on0 
yail0) 80s io Tis Oped 10" 0" (OF. Oeero 
A= v),]00 0. 30) 1) 1) 05 100)'0), (OROas0 
o5|0 10 0) (01211 0s 0 0e 40 
14:10" 0) dOtetOy (O08 4011 L. 40° 1Oeeel 

7 Ode AO Oe OnnO, SOs lhl) AG 

tig (O» (Ore 10s 0% 0) 10! 00 \40! dd, 


Some interesting properties of a graph are observed in its incidence 
matrix. For example, since an edge of the graph is incident with exactly 
two vertices, every column of its incidence matrix contains two unit entries. 
A loop, since it is (twice) incident with a single vertex, is the only 
exception. Hence its column has zero entries. Thus the incidence 
matrix gives no indication of the location of loops, because one does not 
know which zero corresponds to a vertex. Hence it is advisable in the 
matrix study of graphs to exclude loops, as we shall henceforth. 

If the edges and vertices are appropriately indexed, each component 
of a graph corresponds to a submatrix of a block partitioning of the 
incidence matrix as follows: 


ae 0 
Aandi Gtoekien 
0 0 An 


where A; is the incidence matrix associated with the ith component. A 
block-diagonal representation of this type can always be accomplished 
either by sequential labeling within each component and from one com- 
ponent to the next as illustrated in the example or by rearranging the 
rows and columns of the incidence matrix, which, of course, produces the 
same effect. We can now state that two graphs are isomorphic if they 
have the same incidence matrices to within a permutation of rows and 
columns. Thus, the incidence matrix provides a complete representation 
of a graph (if loops are excluded). 


Exercise 5-1 Show the equivalence between this definition of isomorphism and 
that given in Chap. 1, 
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Theorem 5-1 The rank of the incidence matrix of a p-component 
graph with n vertices ism — p (where it is understood that arithmetical 
operations are performed mod 2). 

Remark: The rank may be quite different for the incidence matrix 
considered as a matrix of real numbers, for example. 

Proof: Let n; be the number of vertices belonging to the component 
P;(i=1,..., p) which is represented by the submatrix A;; we shall 
show that A; has the rank n; — 1. It is easy to show that the rank of A; 
is at most equal to nm — 1. This follows from the fact that the sum of 
all the rows of A; is zero mod 2, since each column has two nonzero ele- 
ments, and hence if the first m; — 1 rows of A, are added to the last row, 
the result is a row with zero elements everywhere. Since the rank of a 
matrix is unchanged by the elementary operation of row addition, the 
rank of the new matrix which has a row consisting of zeros cannot exceed 
nm — 1. The sum of any n; — 1 or fewer rows of A; must have some non- 
zero element; for if all the elements are zero, then these rows form the 
submatrix of a component, since none of the vertices corresponding to any 
of these rows would be connected by an edge to another vertex not in this 
set, which contradicts the fact that the component P, is a maximal con- 
nected subgraph and proves that the rank of P; is n; — 1, since every set 
of n; — 1 rows is independent. 

As another method for showing that the rank of A; is exactly n; — 1, 
we may start with the first row of A; and, by permuting its columns, obtain 
for the main diagonal element in the upper left-hand corner a nonzero ele- 
ment. By the addition of the first row to any row which has a nonzero 
element in the first column, all elements in the first column except the first 
become zero, The second row must have another nonzero element which 
by permuting columns is brought to the (2,2) position and all elements in 
the second column are similarly reduced to zero. The process is repeated 
for the third row, ete., until a diagonal matrix is obtained. As indicated 
above, all the elements of the last row can be made to equal zero. This 
gives rise to a triangular matrix with n; rows which with the exception of 
the last element has unit entries on its principal diagonal. Since these 
operations do not influence the rank, the rank 
of A; must be n; — 1. Thus the rank of A is 43 
n — p, where p is the number of components. 

As indicated previously, the coefficients 
of the incidence matrix of a directed graph 
consist of —1, 0,1. The coefficient is zero 
if a vertex is not incident with an are, +1 
if the arc is oriented away from the vertex, 2 
and —1 in the opposite case. Thus the 
graph of Fig. 5-2 has the incidence matrix Figure 5-2 
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3/1 0 
Vo 1 1 
03 0 —1 


The matrix of a directed graph with n vertices and p components also has 
rankn — p, The term “rank” is associated with a graph that is directed 
or not, and refers to the rank of the incidence matrix. 


Exercise 5-2 Form the incidence matrix of the 2-component directed graph 
given in Fig. 5-3, after indexing its ares. 


5-3 THE CIRCUIT MATRIX 


The circuits C; of our graph (labeled as in Fig. 5-4) have the following 
cireuit matrix C, each row of which characterizes one of the circuits. 


(Here it is a coincidence that each edge except e2 belongs to just two 
circuits.) 


€2 €3 €4 @5 €6 C7} Cg Cy C19 C1 


G/Ou 0k aie ly 00} CMOmIOmmO 
Cy fie 1 0 0! Dd LN0n,0) sono 
eee fe Re pln Teele 10 10 ea? 
CoN 10) 0) OV 0NtO) Oli) nad 
MeNe.0 10 OO), O10! 410% gi palo 
Ci NO 10) 50) 0 OmcOniaib Omen 


The reader will observe that the circuit matrix is also block-diagonal, 
which is a consequence of the fact that every circuit is contained entirely 
in one component, and the edges of each component are numbered con- 
secutively to avoid the necessity of permutation. 
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Figure 5-3 
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Figure 5-4 


Theorem 5-2 The incidence matrix A and the transpose C’ of the 
circuit matrix C, when arranged using the same order of edges, are orthog- 
onal, that is, AC’ = 0. 

Proof: A sketch of the proof of this theorem is as follows: Each vertex 
incident with a circuit is incident with an even number of edges that are 
also in the circuit. Thus when a row of A is multiplied by a column of 
C’, we either obtain a sum of elements each of which is zero (if a vertex is 
not in a circuit) or a sum which is =0 mod 2. 


Exercise 5-3 Apply Theorem 5-2 to the incidence matrix (without the loop) 
and the circuit matrix given above. 


Remark: If a graph is disconnected, then the incidence, circuit and 
cut-set (see below) matrices of each component may be treated separately, 
because none of the properties studied below depend on the fact that there 
is more than a single component. For this reason we limit our study to 
connected graphs, that is, p = 1 with m edges and n vertices and without 
loops. 


5-4 THE CUT - SET MATRIX 


We now write down the proper cut-sets of the graph of Fig. 5-5, thus 
forming the cut-set matrix, each row of which characterizes one proper 
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Figure 5-5 
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cut-set. The proper cut-sets are 


Ki = {e2} Ka = {ese;} Ka = {exes} Ke ={es,04,¢0} 
Ks = {¢s,¢5,€6} Ke = {¢4,€6,¢7} Kz = {e5,¢6,¢7} 


and the cut-set matrix is given by 


" 


€o 3 C4 5 eg er 

Ki{1 0 0 0 0 0 
K,}0 1 0 0 0 1 

Ka lOpOin det 0 © 
E=Ke|O 1.1 0 1 0 
Big 0: Vili Ore ach) 
Be} OyD Oeikod 
KeO! OP lay 


We have the following orthogonality relation between the cireuit 
and cut-set matrices arranged with the same order of edges: 

Theorem 5-3 The circuit matrix C and the cut-set matrix K of any 
graph satisfy the relationship CK’ = 0. 

Proof: The theorem follows from the fact that a cut-set always has 
an even number of edges (which may be zero) in common with every 
circuit. 

Thus since a proper cut-set separates the graph into two parts, it 
either divides a circuit or it does not. In the latter case the number of 
common edges is zero; in the former case an even number of edges of the 
cut-set must belong to the circuit, by Theorem 1-8. 


Exercise 5-4 Show that if a graph is connected and inarticulate, then its cut-set 
matrix contains its incidence matrix, 


Remark: Yor planar graphs the problem of enumerating the cut-sets 
is equivalent to that of enumerating the circuits in the dual graph. 

We now indicate a procedure for deriving the basis matrix for the 
circuits of a connected graph. We first designate a complete tree of the 
graph. In the following discussion all trees are understood to be spanning 
trees. The tree contains » — 1 branches and m — n+ 1 chords. We 
have from the definition of a tree that to each chord corresponds a circuit 
of the graph, namely, the chord and the unique chain in the tree joining 
its end points. Each circuit has at least one edge not in any of the 
remaining circuits. Thus, the vectors corresponding to these m — n + 1 
circuits are linearly independent and the rank of the circuit matrix is at 
least m — n+ 1. We now show that equality holds and hence that the 
circuits determined by the chords form a basis for all circuits of the graph. 

Here we make use of the following theorem from matrix theory: The 
rank R of the product of two matrices, the first of order p X q and the 
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second of order q X 7 with ranks hk; and hs, respectively, satisfies the 
inequality 
R2h+k—q 


If the two matrices are orthogonal, then R = 0 and ky <q—k:. We 
take the incidence matrix as the first matrix and the transpose of the 
circuit: matrix as the second. Since the rank of the incidence matrix is 
ky = n — 1 and q = m is the common order of the circuit and incidence 
matrices, it follows that for ko—the rank of the circuit matrix—we have 


ke m—n+1 
However, as we have seen above, kz > m — n + 1 and hence 
ke=m—n+1 


We have proved the following theorem: 

Theorem 5-4 The rank of the circuit matrix of a connected graph 
with m edges and n vertices is m — (n — 1). 

If the graph has p components, then the rank of the circuit matrix 
ism —n-+ pp. This number is also known as the cyclomatic number of 
the graph. 


Exercise 5-5 Use Theorem 5-4, on the rank of the product of two matrices, to 
prove that the rank of the cut-set matrix is at most n — 1. 


Remark: There is a 1-1 correspondence between nonsingular sub- 
matrices of rank n — 1 of the incidence matrix of a connected graph and 
the trees of the graph. A similar correspondence exists between the 
submatrices of rank m — n+ 1 of the cireuit matrix and the comple- 
ments of the trees of the graph. 

We shall motivate the first part of the remark. The incidence matrix 
of a spanning tree is a submatrix of the incidence matrix of the entire 
graph. It has all the vertices (rows) but only n — 1 edges (columns) 
of the tree. Since a tree is a connected graph, its matrix has the rank 
n—1. Conversely, a submatrix of 
rank n — 1 corresponds to n rows 
and m — 1 columns. Since the col- 
umns are linearly independent, all 
vertices are connected and we have 
a tree. 

To illustrate the method of gen- 
erating a basis matrix for the cir- 
cuits, we again consider the graph 
given in Fig. 5-6. The bold edges 
indicate a selected tree whose chords 
are e,and e;, Figure 5-6 
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The circuit-basis matrix, which 
is a submatrix of the circuit matrix, 
% ¢3 includes just those circuits deter- 
mined by the chords of the selected 
tree and is given by 


¢5 e, C4 ey H €2 63 €5 
ae, Cx fie O! inGie0) 1 a! 
*s Cha ROMY Ord 


which has the form 
Figure 5-7 é= (1\Cs2) 


where J is the identity matrix determined by the incidence of the chords 
with the cireuits and Cy» is the corresponding submatrix for the branches, 

In the case of a directed graph we introduce orientations of the 
circuits as shown in Fig. 5-7, where the orientation of each basis circuit 
is induced by the inherent direction of its associated chord. In this case 
the circuit-basis matrix has the form 


Note that —1 occurs where the circuit orientation is opposite to that of 
the are direction. 
Since the rank of the incidence matrix A is n — 1, we can write 


An Ay 
4=( 
An Ass 
where A, is a square nonsingular submatrix of dimension n — 1. We 
have seen that such nonsingular matrices correspond to trees, and hence 
Ay is the submatrix corresponding to a tree which, for example, may be 
the tree chosen to construct C above. 


_ From Theorem 5-2 we have AC’ = 0, from which it follows that 
AC’ = 0, where C is the circuit-basis matrix. The last relation may be 


written as 
ts Ad) 
An An, Ce 


where the edges of A must be ordered in the same way as those of C. 
Multiplying out yields 


Aul + AnCy, = 0 
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Since Aj, is nonsingular, we have 
Cro = —Ayw An = AwtAn 
since —1 = 1mod2. Thus we construct C from A. 


Exercise 5-6 Use this method to obtain C from A for the graph of Fig. 5-6. 
Let the first two columns correspond to the chords. Note that Ai. must correspond 
to the tree. 


A basis matrix for the cut-sets is obtained by taking a tree and con- 
sidering each of its branches at a time. The removal of a branch breaks 
the tree into two subtrees and partitions the vertices accordingly. The 
branch and any chords incident with both subtrees comprise a basis 
cut-set. (In directed graphs, —1 reflects opposition to the branch’s 
direction.) 


Exercise 5-7 Use the same tree we used to derive the circuit-basis matrix to 
show that the cut-sets obtained in this manner are Ki, Ks, Ks, Ko. (See page 106.) 


From a cireuit-basis matrix C it is possible to obtain the cut-set- 


basis matrix K, and conversely. By adding rows and permuting columns, 
K can be partitioned as follows 


K= (Ku\D) 


where K,, is the submatrix corresponding to the chords and J is the 
identity submatrix corresponding to the branches. From the ortho- 
gonality relation which exists between the circuit matrix and the trans- 
pose of the cut-set matrix we have CK’ = 0. Thus 


(1|C12) (#) =0 
hence Ki, + Cx = 0 
from which we have Kj, = C12, since —1 = 1 mod 2. 
Exercise 5-8 Use this result to obtain Kj, from Cy» in Exercise 5-6 and verify 


the equivalence of the answer with that of Exercise 5-7. 


Enercise 5-9 Write down the incidence, circuit, and cut-set matrices of the 
complete graph having five vertices. 


Exercise 5-9a Give the ranks of the dual incidence, cut-set, and circuit matrices 
and construct these matrices for the foregoing examples. 


5-5 THE VERTEX OR ADJACENCY MATRIX 


We can also define a vertex (or adjacency) matrix for both directed and 
undirected graphs. The element in the (i,j) position of the matrix is 
equal to the number of edges incident with both vertex i and vertex j 
(or directed from vertex ¢ to vertex j in the directed case). Thus for the 
directed graph of Fig. 5-3 (see also Exercise 5-2 on page 104), we have the 
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following vertex matrix: 


V7 


ccooocoores 
cocororocs 
coococcos 
cooocroos 
cooorrwroos 
oooooooc’;, 
eoorocecos 
SCnNnroococes: 


Ug 


In general, we have the following theorem regarding the vertex matrix V 
of a graph: 

Theorem 5-5 The matrix V" gives the number of are progressions 
of length n between any two vertices of a directed graph. 

Proof: If ay is the number of ares joining v; to 2% and a,; is the number 
of ares joining x to v;, then a.a,; is the number of different paths each 
consisting of two ares joining »; to v; and passing through 2. If this is 
summed over all values of k, that is, over all the intermediate vertices, one 
obtains the number of paths of length 2 between v; and v;. If we now use 
a,; to form 4;;jm, we have the number of different paths of length 3 between 
v; and v» passing through v;, and so on. Thus if we assume the theorem 
true for V"—!, then the coefficients of V" = V"—1V give the number of 
paths of length » between corresponding vertices. This completes the 
proof. 

Remark: Obviously, if for some N, V" = 0 for n > N, then there are 
no cycles in the graph. If the graph has no cycles, then the elements of 


V* give the number of simple paths between any two vertices of the graph. 


Exercise 5-10 Prove a similar theorem for undirected graphs. Denote their 
matrix by V. 


Remark: We now discuss the use of the vertex matrix as a tool in 
investigating the properties of a binary relation F in a finite set 


8 = {ay,,....« ,Gn} 


Let V = (v;;) be the vertex matrix associated with R. If R is reflex- 
ive (irreflexive), V has 1’s (0’s) down its main diagonal. If R is sym- 
metric, V is a symmetric matrix. If 2 is asymmetric, then V* will have 


the same elements along its main diagonal as V has, since the only path 
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of length 2 from a; to a; is that obtained by traversing a loop at a; twice 
(if one exists), unless there is some other element a, such that a;Ra, and 
a;Ra;. {In the latter case, there will also be an extra path of length 2 


from a, to a;, and the V* = (»,;‘®) matrix will have elements vj = v,; + 1 
and »,; = v, + 1.] 

Transitivity is also easily determined. The graph-theoretic state- 
ment of this property is the following: If there exists a path of length 2 
from a, to a;, then there exists an are from a; to a;. Thus, it is only neces- 
sary to check whether »,; is positive whenever v;; is positive. If so 
then # is transitive. 


The Index of Primitivity 


Note that the vertex matrix in both the directed and the undirected 
case has nonnegative entries. Zero entries imply that the vertex cor- 
responding to the row of the entry is not conneeted by an edge to the 
vertex corresponding to the column. However, these vertices may be 
connected by a path (or chain) of a certain length. It may be possible 
that V" > 0 for some integer m > 1; that is, after V is raised to the power 
m, all its entries become nonzero. In that case every vertex is accessible 
to every other vertex in are progressions of m ares in length, and the 
graph corresponding to V” is complete (in fact it has an are directed 
from any vertex to any other vertex) and has a loop at every vertex. 

Let us now consider a directed graph D. If D is strongly connected, 


then its vertex matrix, which we denote by V, is said to be zrreducible. 
In general, a nonnegative square matrix U (from which we form a 
directed graph according to whether an entry is zero or positive) need 
not be irreducible, If it is irreducible, i.e., the graph is strongly con- 
nected, and if U™ > 0 for some integer m > 1, then U is called primitive 
(and so is its associated directed graph) and the smallest integer m for 
which this is true (i.e., every vertex is accessible to every other vertex 
by an are progression of m edges) is known as the index of primitivity 
of U and is denoted by y(U). We now give the preliminaries in order 
to show that an irreducible matrix is primitive if and only if the greatest 
common divisor of the lengths of all its simple cycles is unity (Ref. 73 
of Chap. 6). 

Definition If H is a subgraph of a graphG, the number of vertices 
in H is said to be the order of H. 

Definition I{ P is an are progression in a directed graph D, the 
number of ares (counting repetitions) in P is said to be the length of P. 

Lemma 5-6 If D = (V,A) is a primitive graph with index of 
primitivity m, and if v,w eV, then for all integers g > m there exist an 
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are progression of length g from v to w and an are progression of length q 
from w to v. 

Proof: Observe that if U is a primitive matrix, then every row and 
every column of U must contain at least one nonzero element. It then 
follows that the product UU contains only positive elements. In 
fact, the product U"*" - U = Us, where nis any positive integer, contains 
only positive elements. 

Theorem 5-7 A necessary and sufficient condition for a strongly 
connected graph D = (V,A) with n > 2 vertices to be primitive is that 
the GCD (greatest common divisor) of the lengths of all simple eycles in 
D is unity. 

Proof, sufficiency: Suppose D = (V,A) is a strongly connected graph 
with nm > 2 vertices. Suppose D has r simple cycles with orders Pr; Pr, 

. , pr and suppose that the GCD of these orders is unity. Choose a 
closed are progression P such that P passes through all the vertices of D. 
We need not be concerned with which vertex we start and terminate; 
the symbol P will denote any progression equivalent to the one first 
chosen in the sense that the ares are encountered in the same order even 
though the initial (and final) vertices may be different. Let P have 
length 7. 

Since D is strongly connected, there exists, for every vjwe V, a path 
from » to w. With each such ordered pair (v,w) we associate a positive 
integer 7 < n giving the length of some path from v to w. Now if we 
wished, we could “travel” from v to w by (1) first traversing P (from » 
to v) and, at various predetermined vertices, taking “detours” around 
simple cycles in which these vertices lie and (2) subsequently travers- 
ing the path with 7 ares. The length of this trip can be expressed by 


r 
s=jt >) expe + %, where ey is a nonnegative integer for each k. We 
k=l 


shall show that, for each 7, the e can be so chosen that s is constant, 
i.e., it does not depend on 7 in that e, are chosen to help eliminate 7. 

We recall that if K is a set of integers whose GCD is 1, then 1 can 
be written as a linear combination of these integers. Hence we can 
choose positive integers a, a2, ... , a;, which may be placed in two 
complementary sets X and Y such that 


Y ape — ¥ avpe = 1 
eX ke 
Now for each integer 7, 1 <i <n, and each integer k, 1 < k <r, we 
define (for a sufficiently large integer m) 
(m +1 — t)ay ifkeX 
(i — 1), ifke Y 


Ck = 
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Then » CnDe +t x (m + 1 — ita, + > (@ — ldap, +7 


m Xam “+ damn = Pan raat » UPe 
+2) ape +i=m yap +1 
ke. 


keY 


i] 


which is a constant. 
Hence there exists a fixed integer s = 7 + m Dy ape + 1 such that 
ke. 


for every ordered pair v,w« V there exists an are progression of length s 

from v to w. Hence there is an integer s such that if U is the vertex 

matrix of D, then U* > 0. (To see how a given desired path of length s 
r 


is realized, recall that s = 7 + M CxPr + 2.) 
kal 


Necessity: Suppose that D = (V,A) is a primitive graph with 
n > 2 vertices and index of primitivity m. By hypothesis (actually, 
by primitivity) D is strongly connected. Choose a closed are progression 
P (from some ve V to v) such that every simple cycle in D is traversed 
at least once and ¢ > m, where ¢ is the length of P. Since P can be 
divided into a succession of simple cycles (generalize Exercise 2-3 to a 
closed are progression) ¢ may be written as a linear combination of the 
orders pi, pz, . . - , pr of the r simple cycles, where all the coefficients are 
positive. Now by Lemma 5-6 there exists a closed are progression of 
length ¢-+ 1 from v tov. Then (by the lemma) there exist closed are 
progressions R (twice P) and S of lengths 2¢ and 2¢ + 1, respectively. 
Notice that both 2¢ and 2¢ + 1 can be written as linear combinations of 
the p; (i = 1, 2,. . . , r) with strictly positive coefficients. The above 
generalization of Exercise 2-3 can be applied to the closed arc progression 
of length r + 1, and S, which has length 


t+i+1=2t+1 


can be taken as the desired linear combination.) But to express two 
consecutive integers as linear combinations of the same set of integers 
pi is possible only if the GCD of the p; is unity. This completes the proof. 

An undirected example of nonprimitivity is a bipartite graph, a 
special case of which is the lattice graph, such as the boundaries of the 
squares of the chess board, in which some pairs of vertices are joined only 
by chains having an even number of edges and other pairs are joined 
only by chains with an odd number of edges. (This fact is true of all 
bipartite graphs. Prove this as an exercise.) For these graphs all simple 
cycles have an even number of edges and do not meet the definition of 
primitivity, since 2 is a common divisor. 
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The tth power of the matrix V of a directed graph D is a matrix with a 
directed graph having the same vertices and such that the ordered pair 


(v;,0)) is an edge of yi if and only if there is an are progression in Vv from v; 
to v; of length ¢.5 


Exercise 5-11 Show that if v is primitive, then so is ve for t > 0. 


Exercise 5-12 Show that, in a primitive graph, with every vertex v; there is 
associated an integer h such that for every vertex vs there is a path of length h from vy 
to vj. The smallest value of h is called the reach of v; and is denoted by hy. 


Exercise 5-13 Show that, in a primitive graph, if p > hy there is an are progres- 
sion of length p from x; to any v;. (Note that because the graph is strongly connected, 
there is an are from some v; to vj and hence a path from »; to v, to vy of length hy + 1, 
etc., by induction.) 


Exercise 5-14 Show that, in a primitive graph D, »(D) = max [hi, . ~ . shal- 


Exercise 5-15 If there is a loop at the vertex v; of a strongly connected graph, 
then h; <n —1. 


The notion of the index of primitivity is important. Consider, for 
example, a hypothetical chemical reaction with “feedback,” i.e., reeycling 
of components, where the vertices of the graph would correspond to various 
chemical ingredients and where it is desired to know whether a certain 
product can be obtained by a process consisting of the same number of 
steps and starting at any of the vertices. Or, in mathematical economies 
if U is a primitive matrix, the exponent m corresponds to the mth round of 
transactions such that, if y(D) = m, all sectors in an economic input- 
output model are connected at the mth round and all subsequent rounds. 

The question now is: When is a nonnegative irreducible matrix prim- 
itive and what is its index of primitivity, or an estimate of it? We shall 
be briefly interested in the second part of the question. 

One of the oldest estimates on the index of primitivity, due to 
Lowenheim and to Wielandt, is that it cannot exceed (n — 1)* + 1, where 
n is the number of vertices and, of course, also the order of the matrix. 
It is a corollary of the following theorem: 


Theorem 5-8 If D is a primitive graph and V is its vertex matrix 
and if s is the length of the shortest simple cycle in D, then the index of 


primitivity of V satisfies 
(V) < n+ s(n — 2) 


Proof: According to Exercise 5-11, D* (the graph corresponding to V*) 
is primitive, and because D has a simple cycle of length s, D* has at least 
s vertices with loops. Hence there is an are progression of length 
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pi <n — 8 from any vertex v; to some vertex v, that has a loop in D*, 
Again according to Exercise 5-15, there is an are progression of D* 
of length n — 1 from » to any v;. Thus the are progression in D from 
vz tov, has length (n — 1)s, and that from 0; to v; has length p; + (n — 1)s. 
Hence hi < pi + (n — 1)s and 


¥y(D) = max[m,..., ml Sn—8s+(n—1)s = n+ a(n — 2) 


Corollary 5-9 In a primitive graph, the GCD of the lengths of all 
simple cycles is unity and hence s < n — 1, from which it follows that 


(V) Sn+(n—VYm—2) =(n- 1) +1 


Heap and Lynn" show that if the graph of U has at least k > 2 simple 
cycles whose lengths are distinct and relatively prime, then the index of 
primitivity does not exceed 


(n —1)?+1—4(k — 2)(Qn —k — 3) 


5-6 THE PATH MATRIX 


In a connected graph with indexed vertices, a path (properly chain) 
matrix P is obtained by taking for its rows P;, the paths from the first to 
the last vertex, and for its columns the edges of the graph. Hence the 
entries of the matrix are 1 or 0 depending on whether an edge belongs to a 
given path or not. For example, the graph of Fig. 5-8 has the matrix of 
paths between v; and »; given by 


€; €2 @3 C4 Cy 66 

Payee 0) 20! 50) a) 16 
Pera dO 0 31. O01 
Bae) Lei Ol. iow @ 
PON 1 fT tL 0 4 


Theorem 5-10 The product AP’ of the incidence matrix and the 
transpose of the path matrix is a matrix all of whose rows except the first 
and the last have zero elements. 

The exceptional two rows have ¢, v9 es 
unit elements everywhere. ¥j %% 

Proof: An element of AP’ is 
unity if and only if an edge is at the e fe eg 
same time in a given path and in- 
cident with the first or last vertex. 
There is exactly one such edge in 
any path between these two ver- Figure 5-8 
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tices. Nonterminal vertices along any path have degree 2 or 0, and 
hence the remaining elements are zero mod 2.*4 

Remark: The rank of the path matrix of a graph with n vertices and 
m edges ism — n + 2 — c, where c is the number of independent circuits 
in those separable subgraphs between the terminal vertices whose removal 
from the graph does not remove either of the terminal vertices. From 
Theorem 5-10 note that the rank of P does not exceed m — n+ 2. Note 
also that the chords of the trees in the subgraphs described above do not 
belong to any path. 


5-7 THE REALIZABILITY OF THE CIRCUIT AND 
CUT - SET MATRICES 


We have already examined the problem of constructing and characterizing 
matrices associated with a graph. The converse problem of constructing a graph 
corresponding to a given matrix is generally either trivial or very difficult. The 
first case is readily illustrated by a matrix with exactly two unit elements in every 
column and zeros elsewhere. A graph can always be constructed with the given 
matrix as the incidence matrix. Because an edge may be incident with more than 
two circuits or two cut-sets, the problem of constructing a graph for a matrix 
which is to serve as a circuit matrix is not so easy. 

The question of realizability has been examined by several people. An 
interesting exposition of the ideas is found in the paper by Ash and Kim.! A 
rigorous and deep theory leading to necessary and sufficient conditions for realiza- 
bility has been developed by Tutte.!71* We shall present the barest essentials 
for the main theorem. 

Consider the column vectors of a matrix. One can say that a subset of these 
columns is either linearly independent or linearly dependent. Thus the subsets 
fall into two classes which are not arbitrary since, for example, the following two 
theorems must hold.!* 


1. Any subset of an independent set is independent. 

2. If Np and N,y4: are independent sets of p and p + 1 columns, respectively, 
then N, together with some column of N41 forms an independent set of p + 1 
columns. 


A system obeying (1) and (2) is called a matroid. There are theorems not 
deducible from (1) and (2), that is, there are matroids with no corresponding 
matrices. Thus every matrix is a matroid, but not conversely. 

The above definition of a matroid is a special version of the following general 
definition. A matroid on a finite set M is a class M of nonnull subsets of M which 
satisfies these conditions: 


1. No member of M contains another as a proper subset. 

2. If X,YeM, ae X MY, and be X — (XY), then there exists Z eM 
such that be Z C(X UY) — {a}. 

The reader should have no difficulty in verifying that the set of all simple 
circuits or all proper cut-sets of a connected graph forms a matroid on the set of 
edges of the graph. We shall use 2 to denote the ring of integers or the ring of 
residues modulo a prime number. 
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A chain on a finite set M over R is a mapping f of M into R, that is, if ae M, 
then f(a) is the coefficient of ain R. One can define the sum f + g of two chains f 
and g, and the product kf, where k e R, by the relationships (f + g)(a) = f(a) + 
g(a) and (kf)(a) = kf(a), for ae M, and show that the chains.on M over R are 
the elements of an additive abelian group A(M,R). A chain group on M over 
R is any subgroup of A(M,R). 

A chain f of a chain group N is elementary if it is nonzero and there is no non- 
zero ge N such that |g| is a proper subset of |f|. Here the set of all ae M such 
that f(a) # 0 is the domain |f| of f. 

If R is the ring of integers, we define a primitive chain of N as an elementary 
chain f of N in which the coefficients f(a) are restricted to the values 0, 1, —1. 

A chain group N is called binary if R is the ring of residues mod 2. It is 
regular if R is the ring of integers and if to each elementary chain there corre- 
sponds a primitive chain with the same domain, 

If M(N) is the class of domains of elementary chains of N, then M(N) is a 
matro'don M. Itis called the matroidof N. A matroid is binary or regular if it 
is the matroid of a binary or regular chain group, respectively. 

Let K(G) and C(@) denote, respectively, the matroid of proper cut-sets and 
that of simple circuits of a graph G. A matroid M is graphic or cographic if there 
is a graph G such that M = K(G) or M = C(Q), respectively. 

If M is a matroid on a set Z, and if S C EZ, let M-S denote the set of 
minimal nonempty intersections with S of elements of M (where a minimal non- 
empty intersection is one not containing another). Then M - Sis a matroid on 8, 
ealled the reduction of M to S. (In a graph G, let G - S denote the subgraph of @ 
determined by the set S of edges. If M is the cut-set matroid of G, then M - Sis 
the cut-set matroid of G - S.) 

Similarly, the set of elements of M which are subsets of S is a matroid on S, 
denoted by M X Sand called the contraction of MtoS. (Ina graphG, letG x S 
denote the subgraph obtained by contracting to a point all the edges not in S. 
If M is the cut-set matroid of G, then M X S is the cut-set matroid of G X S. 
In circuits, the roles of ‘‘dot’’ and ‘‘cross’’ must be interchanged.) 

Finally, if M is a matroid on EZ, a matroid of the form (M - 8) X 7, where 
T CS CBE, is called a minor of M. 

Theorem 5-11 A matroid M is graphic (cographic) if and only if it is 
regular and has no minor which is the circuit-matroid (cut-set matroid) of a 
Kuratowski graph. 


5-8 RELATION TO COMBINATORIAL TOPOLOGY 


In this section we relate the foregoing concepts to certain notions which 
are basic to combinatorial topology. Some of the results discussed here 
are repetitions of material presented earlier, but they are shown in a 
different framework. Also included is the proof of a coloring theorem. 
A 1 chain on a directed graph G with n vertices »; and m edges & is a 

m 


linear combination of the edges, that is, yy, a,¢,, where a, are real, although 
kel 

one need not be restricted in this manner but can choose a, complex or 

just rational. All 1 chains define a vector space U; on G with the edges 

forming an m basis, where chain addition is defined by adding correspond- 
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ing coefficients. Similarly, it is possible to define a vector space Uo of 
0 chains by using linear combinations of the vertices with real bj, that is, 
n 


they are sums of the form y bv; We define the inner product of two 
i=1 

1 chains or two 0 chains as the sum of the products of corresponding 
coefficients. If this sum is zero, they are said to be orthogonal. 

We can introduce a boundary operator @ (and a coboundary operator 
6) which transform VU; into Us and Up into U;, respectively. If @ is defined 
on an edge e, then it can also be defined on a 1 chain by requiring it to be 
linear. Let ¢ be oriented from the end point p; to the end point ps; then 
we define 

de=pi-—p2 and 0) aer= Ya der 
Kat kat 

Note that p: and pz may coincide to form a single vertex of the graph. 
If v is a vertex of the graph, then the coboundary of v is defined by the 
“bundle at v”’ 


bv = Le — Le 


where the first sum is taken over the edges directed toward v and the 
second sum is taken over the edges directed away from». Similarly, the 
coboundary of a 0 chain is 


n n 
6 Sy bai = Yb; 60 
i=1 i=1 
Note that the boundary of a cycle or a linear combination of cycles is zero, 
The 1 chains haying zero boundary form a vector space which is a sub- 
space of U;. A cocycle isa 1 chain and is the coboundary of some 0 chain. 
The set of coboundaries is also a vector space which is a subspace of Uo. 
We have the following interesting theorems.® 


1. The inner product of the boundary of any 1 chain and any 0 chain 
is equal to the inner product of the 1 chain and the coboundary of the 
0 chain. 

2. The inner product of a cycle and a coboundary is zero. 

8. A 1 chain which is a cycle and a coboundary is the zero vector of 
U1 

4. A 1 chain which is orthogonal to every coboundary is a cycle. 

5. A 0 chain which is orthogonal to every cycle is a coboundary. 

6. A 1 chain has a unique expression as the sum of a cycle and a 
coboundary. 

7. The dimension of the cycle space is m — n + 1 and that of the 
coboundary space isn — 1. 
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8. Using the integers mod 2, a set of edges is a cut-set if and only if 
it is a coboundary. 


There is an interesting way to introduce incidence matrices by using 


the boundary operator 0, Consider the vector e = (1, . . . , @m) and 
define de = (de1, . . . , 0m); then consider the set of end points p:, po, 
+ » Pom Of 1, ... , &m and form the vector p = (pi,p2, . . . ,Pam)- 


Let D be a 2m X m matrix whose elements dj; are 0, 1, or —1 depending 
on whether end point 7; is or is not incident with edge e; and on whether 
it is an initial or a terminal point of ¢. Then de = pD. If we now 
associate with each end point p; the corresponding vertex v; and form the 
vector v = (vj, . . . Yn) and the m X 2m matrix D whose 7, 7 element is 
0 or 1 depending on whether end point p; is associated with vertex v; or 
not, we have p= vD. Thus de = vDD, where DD is the incidence 
matrix A of the graph. 


Exercise 5-16 Construct the matrices D and D for the graph of Fig. 5-3 and 
obtain the incidence matrix from the product DD. 


Theorem 5-12 A map on a sphere in which every vertex is of 
even degree can be colored with two colors. '* 

Lemma 5-13 A closed curve C which does not pass through the 
vertices of our map crosses edges of regions an even number of times. 

Proof of theorem: The lemma (proved below) implies the theorem; for 
if we use red and blue and start by coloring region 2, with red, the color 
of any other region R; is determined by counting the number of edges 
that are crossed by an arbitrary curve drawn from a point in R, to a point 
in R;. We choose red if the number is even and blue if it isodd. From 
the lemma, any two arbitrary curves from R, to R; yield the same color 
for R;. Inthis manner adjacent regions will have different colors, since a 
curve from R, to one of them yields one color and, when extended across 
a boundary edge, must yield the other color. 

Proof of lemma: We define f'(e;) = 0 if C crosses edge e; an even num- 
ber of times and f'(e;) = 1 if C crosses e; an odd number of times (zero is 
even). 

Since the total crossing by C of the boundary of each region is even 
(each time it crosses into the region it must also cross out), f! must be a 
cocycle and is therefore the boundary of some 0 cochain f*. Let 1, 

. , %: all be the vertices on each of which f° takes on a unit value. 
Because the total number of edges meeting these vertices is even (an edge 
incident with two of these vertices is counted twice), the total number of 
edges which meets just one (but not two) of these vertices iseven. These 
are the only edges for which df° = f! assigns a unit value, and hence f! 
assigns a unit value to an even number of edges. From the definition of 
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f' we have the fact that there are an even number of regional boundary 
edges which C crosses an odd number of times. 
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PART II 


APPLICATIONS 


Part II contains a variety of representative applications of graph 
theory. Chapter 6, designed to be a long chapter, contains numerous 
brief applications illustrating this variety. In it some examples 
are formulated and solved, some are just formulated, and still others 
are briefly described to give a flavor of the wide applicability of the 
subject but must be pursued in the relevant literature for greater 
familiarity. In contrast, Chap. 7 goes deeper into two particularly 
significant areas of application. Thus in Chap. 7 linear flows 
through networks are discussed. General techniques for maximiz- 
ing flow and minimizing cost are developed from a graph-theoreti¢ 
viewpoint, 
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A VARIETY OF 
INTERESTING APPLICATIONS 


6-1 INTRODUCTION 


To study graph theory without being aware of its applications is 
to miss a major part of its substance. With the intention of 
emphasizing the importance of this aspect, we have divided the 
book into two parts, the second of which is devoted to a brief 
presentation of some interesting applications. This chapter has 
the aim of presenting a variety of examples to alert the reader to 
the art of model formulation in science and engineering by using 
graphs. The next chapter, on network flows, has a more specific 
objective. 

The objective of this chapter is, briefly, to point out various 
ways in which graphs can be used to formulate and to solve prob- 
lems. After reading each section, the reader is urged to inquire 
into the underlying properties of the graph being used in that 
section. In certain cases, such as the problem of the colored 
cubes, the use of the simple idea of a graph, when properly chosen, 
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is powerful. For that problem one obtains a lucid explanation of the work- 
ings of a puzzle, sold under such names as Tantalizer, which if solved by 
trial and error is quite unwieldy. In other cases more sophisticated 
graph-theoretic properties are exemplified. Some of these properties 
have found their way into industrial and government operation where, 
for example, PERT is now a basic tool of projected planning for efficient 
operation to meet prescribed objectives. The utility of the simple con- 
cept of a graph emerges throughout. 

It is clear that we must be brief in the description of these applica- 
tions, because most of them can be so expanded that each can occupy a 
chapter by itself. Therefore, for some sections we have settled for a 
short summary of some of the ideas in the hope that the interested reader 
will pursue the subject in the references. We have attempted to group 
the sections under general headings where they seemed to us to have the 
greatest relevance, but this grouping is flexible and the reader may be 
able to find a better framework. 


APPLICATIONS TO ECONOMICS AND OPERATIONS RESEARCH 


6-2 APPLICATION TO ECONOMICS AND LOGISTICS 


One of the most significant large-scale models in mathematical economics 
and logistics is the input-output model associated with the name of 
W. Leontief. This model, which we now examine from a graph-theoretic 
standpoint, constitutes, in fact, a modern realization of the celebrated 
Tableau économique of 1758-1759 due to Francois Quesnay.”* 

In the input-output model a classification grid is imposed on the 
transactions of an economy in which there are m collections of entities 
(e.g., establishments, firms, households). Each collection is called an 
industry, sector, or activity for entities producing (or consuming) similar , 
goods and services. For some historical period and in some set of valua- 
tion conventions, let x; > 0 denote the value of purchases made by 
industry (or activity) 7 from industry (or activity) 7; moreover, let «;; be 
called the value of input to industry i from industry j (1,7 = 1, ... ,m). 
Thus 2, in effect, measures the flow of funds from ¢ to j in return for the 

m 


resource inputs which flow fromjto7. Let thesum »» xn > 0 be denoted 
nel 

by 2; and be designated as the value of output of industry i. Let bi; denote 

the normalized value of input 2i;/x; > 0, and let it be designated as the 

unit input to industry or activity i from industry or activity j (i, j = 1, 

+++, m). Let the following (balance) valuation conventions obtain: 


m m 
p> y= Py Tri 
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for alli = 1, . . . , m, that is, the ith row sum corresponds exactly to 
the ith column sum. 

Consider now the matrix B* = |\bj;|| (i, 7 = 1, ..., m), defined 
above, and any proper principal submatrix B of B* such that B = |\bi;||, 
i4,j=1,...,randr<m. Let the system «(J — B*) = 0, 6 being 
the null vector of order m, be called a closed input-output model. Let a 
system «(J — B) = w, where B is any proper principal submatrix of 
order r < m and w is nonnegative, be called an open-input-output model. 
A solution x of an open or a closed input-output model is called admissible 
if, and only if, the vector z is finite and nonnegative but not null. The 
nonnegative vector w of an open model is construed as a stipulated “bill 
of goods” or “bill of final demand” for those industries or activities which 
correspond to rows and columns which appear in B* but not in B. Admis- 
sible solutions x of input-output models (whether open or closed) are 
construed as activity level vectors which state the level (of value of out- 
put) at which each industry or activity is operated. When the inverse of 
the matrix (J — B) of an open input-output model exists, it may be shown 
to have the well-known power series form’® 


(II-B) = . B 
h=0 


We now proceed to give a graph-theoretic formulation of necessary 
and sufficient conditions for the existence of the inverse of the matrix 
(I — B) of an open input-output model. For this purpose, we introduce 
some additional graph-theoretic definitions.7* Let D be a directed graph 
and let H be a strongly connected subgraph in D. A strongly connected 
subgraph H in D will be said to be maximal in D if, and only if, every 
strongly connected subgraph of D is either a subgraph of H or contains no 
vertex in common with H. A strongly connected subgraph H in D is 
said to be closed in D if, and only if, H is maximal and every vertex of D 
accessible (by means of a directed path) from any vertex of H is contained 


in H. Let A = |la,|| be a nonnegative square matrix of order n, that is, 
a; > Ofori,j =1,...,n. The finite directed graph D(A) of A is defined 
to consist of n vertices ay, . . . , a, and the totality of arcs (c,,0:) such 


that the are (ai,a;) exists in D(A) if, and only if, a; >0 in A. The 
directed graphs D(B*) and D(B) associated with the closed and open 
input-output models introduced above may be shown to have significant 
technological and accounting-flow interpretations. 

A nonnegative square matrix A will be said to be (row) substochastic 


n 
if no row sum of A exceeds unity, that is, a; > 0, 7: = y) a;<1. If 
j=1 
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each row sum of A 1s exactly unity, the matrix is said to be stochastic. 
On the basis of the earlier discussion, it is evident that, in a closed input- 
output model x(J — A*) = 6, the matrix A* is stochastic, whereas, in 
an open input-output model a(J — A) = w, the matrix A is substochastie. 
The following theorem, which we do not prove here, is given in Ref. 75. 

Theorem 6-1 Let A be a substochastic matrix. The inverse 
(I — A)~* exists if and only if there are no closed strongly connected sub- 
graphs in the directed graph D(A) or if, for each strongly connected sub- 
graph H closed in D(A) there exists a vertex of H with associated row sum 
in A less than unity. 

This result may be reformulated in the following fashion: 

Theorem 6-2 Let A* be a stochastic matrix and let A be any proper 
principal submatrix of A*. The inverse (I — A)-? exists if and only if 
no strongly connected subgraph which is closed in D(A) is also closed in 
D(A*). 

Proof: Assume H is a strongly connected subgraph of D(A). Then 
H is closed in D(A) with corresponding matrix in A stochastic if and only 
if H is closed in D(A*). By Theorem 6-1 no closed, strongly connected 
subgraph in D(A) is closed in D(A*). 

Theorem 6-2 has the following immediate corollary: 

Corollary 6-3 Let A*bea stochastic mattix. If D(A*) isastrongly 
connected graph (that is, A* is irreducible or indecomposable), then 
(I — A)~ exists for any proper principal submatrix A of A*. 


Exercise 6-1 Let A be a nonnegative square matrix of order nin which each row 
sum is positive. Show that, in the directed graph D(A) of A, (a) there exists at least 
one closed strongly connected subgraph and (b) that every vertex of D(A) is connected 
to one or more closed strongly connected subgraphs in D(A). 


Exercise 6-2 Show by the use of a “bordering” approach that every open input- 
output model which exhibits at least one admissible solution is equivalent to a “con- 
strained’’ closed input-output model. 


Exercise 6-3 Let A be a nonnegative square matrix of order n. Let x be a 
positive (x X 1) column vector and let y be a positive (1 X n) row vector. Let A 
Az 

oll 
D(A) is strongly connected. Show that, if A in A is distinct from the null matrix, A is 
primitive and a sharp upper bound for the index of primitivity (A) is 4. 


denote the following bordered matrix of order n + 1: The directed graph 


An alternative formulation of an economic “input-output” model, 
given below, is based on a theorem that is somewhat difficult to prove in 
limited space. Consider the following matrix whose coefficients give the 
per unit purchase figures (negative) of material by an industry from other 
industries and the per unit figures for output (positive) on the main 
diagonal, ®:*# 
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v1 v2 V3 


vy 6 —4 —2\0 
vo{ -3 5 0}2 
v\-1 =1 3/7 1 


The determinant of this matrix has the value 38. 

Consider the graph associated with this “interindustry” output and 
purchase model. We introduce a general consumption vertex C from 
which there is flow (consumption) toward each industry whose amount is 
indicated by the net output from that industry. These flows are given 
by the row sums of the matrix. The flow to »; is zero, and hence we need 
not draw that edge of the graph. We then form all possible trees of the 
graph (Fig. 6-1) and take the product over the edge capacities. The sum 
of these values taken over all the trees equals the determinant of the above 
matrix. We have 


2+16+0+24+84+04+8+2=38 


Positive trees correspond to matrices with row sums >0 and non- 
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Figure 6-1 
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positive off-diagonal elements. Thus a single positive tree ensures a 
positive determinant. Such matrices (e.g., Leontief matrices) with 
dominant diagonal are of great interest in mathematical economics. 


6-3 LINEAR PROGRAMMING AND NETWORK FLOWS 


Although network flows are not considered in detail until Chap. 7, we 
shall now give an informal explanation of the basic concept in order to 
exhibit one relationship of flow problems with linear programming 
problems. 

Let vertices vy and v, of a directed graph be designated as the origin 
and destination of a conceptual substance flowing through the ares. 
Assume, moreover, that an arc from vertex »; to vertex »; has an asso- 
ciated capacity, or upper limit on flow, ¢;. Finally, let C,; be the cost per 
unit of flow in the are. Our flow problem becomes a linear programming 


problem requiring the minimization of y Cia, for a total flow ¢ from vp to 
42 


v, Subject to 
»y (oj — 20) = € 
7 
Y Gs — 2x) = 0 fort¢=1,...,n—1 
a 


Y Geni — tin) = —e 
=) 
0< x; < for every arc 


The network flow is sometimes a convenient method for solving this 
type of linear programming problem, which is known as a transportation 
problem when it arises in other considerations. 


6-4 PERT AND RELATED TECHNIQUES 


A directed graph is a natural tool to employ in connection with describing 
and analyzing complex projects composed of numerous interrelated activ- 
ities. The overall project might be, for example, the total process of 
designing, constructing, and testing a piece of equipment or the process 
of designing and erecting a building, including related considerations such 
as site acquisition and site preparation. In general, suppose that we are 
considering some well-defined total project and that we can decompose 
the set of all actions associated with this project into individual nonover- 
lapping activities ay, a2, . . . , dn. There are, of course, various ways to 
break a project down into component parts. Determination of an appro- 
priate breakdown depends on considerations which will be evident later 
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Figure 6-2 


in this discussion. (In general, the individual activities must be such that 
it is possible to acquire the types of quantitative information discussed 
below and such that all essential precedence relationships, discussed next, 
can be determined.) 

While certain activities are independent of one another, there will in 
general be certain essential dependencies, with respect to time, which 
take this form: Activity a; must be completed before activity a; can be 
initiated. If all such time dependencies are given, they can be con- 
veniently summarized by a directed graph such as that of Fig. 6-2. Each 
are represents one activity, and each vertex is called an event and represents 
apointintime. In particular, vertex v: symbolizes the start of the project 
as a whole, and vs represents its completion. The intermediate vertices 
serve as the vehicle for indicating the activity precedences. They are 
chosen, and related to the arcs, in such a way that the following funda- 
mental assertion is valid for every activity (arc) and event (vertex): 

If activity a has initial event or vertex v, then a cannot be initiated 
until every activity which terminates at v has been completed. However, 
a may be initiated any time thereafter. For example, in Fig. 6-2, activity 
10 (as well as 12) can be started only after both activity 5 and activity 7 
have been completed. Indirectly, activity 10 also depends on the com- 
pletion of activities 1, 2, and 4, since 5 and 7 are directly dependent on 
them. The only activities which can be initiated at the outset are 1, 2, 
and 3, and the project is considered to be completed only when 11, 12, 
and 13 (and consequently all activities) have been completed. 

The management techniques known as PERT (program evaluation 
and review technique) and CPM (critical path method), as well as later 
related methodologies, employ such an activity graph as the fundamental 
structure on which analyses are based. To illustrate the most basic type 
of analysis, suppose that each activity a consumes a known amount of 
time ¢(a). A group of several activities may, of course, be performed 
concurrently if no activity in the group is constrained, either directly or 
indirectly, to start only after another has been completed. (This will be 
the case if no one activity can be included in a path which leads from 0; to 
the initial event of another.) Suppose, specifically, that the quantities 
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Figure 6-3 


associated with arcs in Fig. 6-3 represent activity durations, (For the 
moment, we are assuming that the duration of each activity is known and 
constant. In practice, the duration is frequently assumed to vary and to 
be governed by a probability distribution whose general form is known 
and whose parameters can be estimated.) 

The length (i.e., sum of time durations) of any path from »; to »; 
clearly represents a lower bound on the elapsed time, measured from the 
start of the project, before event v; “occurs” or is “realized” and activities 
having »; as initial vertex may be started. This being the case, it is con- 
venient to associate numbers (times) with vertices as follows: 


T(v1) =0 
T(v;) = max {t(P)} fort #1 


where ¢(P) denotes the time-length of path P and where the maximum is 
taken over all paths from v; to 1. 

Note that, by its nature, an activity graph is acyclic. (The presence 
of a cycle would constitute an impossible situation in which no activity of 
the cycle could be initiated first; for each activity would depend on the 
completion of another.) This being the case, we can use the method of 
Chap. 3 to find a spanning tree, rooted at v;, which simultaneously realizes 
a longest path from v; to each other vertex. (We are assuming that every 
vertex of the graph is reachable via at least one path.) Figure 6-4 shows 
an appropriate tree, given the activity durations shown in Fig. 6-3. The 
corresponding values of 7'(v;) are also shown. 


Figure 6-4 
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As mentioned above, the earliest possible time at which an activity 
(v;,v;) can be initiated is at least 7(v;) time units from the start of the 
project. On the other hand, a schedule based on the 7'(v;)’s is in fact 
feasible. More precisely, if we schedule each activity (v,,r,) to start at 
time 7'(v,) and be completed at time 7'(v;) + ti (where t; is the time asso- 
ciated with the activity), then no activity will be begun earlier than the 
ground rules permit and the project as a whole will be completed in 
T(vs) = 13 units, which is the earliest possible completion time. 


Exercise 6-4 Prove that if activities are scheduled in this manner, they will not 
violate any of the precedence relationships. 


A (timewise) longest path from the starting event v; to the ending 
event v, is called a critical path. Its length (in our example, 13) is the 
shortest time in which the entire project can be executed. Moreover, 
this minimal time will be realized only if each activity of the critical path 
is initiated as soon as its immediate predecessor is completed. In general, 
the critical path is not unique. (The reader may note an alternative 
critical path in our example.) An activity is called a critical activity if 
it is contained in one or more critical paths. Assuming that it is desirable 
to complete the project in minimal time, there is still a certain amount of 
latitude in scheduling noncritical activities. Measurement of this latitude 
is known as the determination of slack in the activity graph. This is our 
next consideration. 

Note that each event must be realized (i.e., all activities leading into 
it must be completed) early enough to execute, successively, all activities 
in any path from that event to the end event. This suggests associating 
a second set of numbers (times) with the events, comparable to the T(v;)’s 
but measured with respect to the end event rather than the starting event. 
Specifically, we define 


X(vn) = 0 
X(v;) = max {t(P)} fori An 


where ¢(P) is the length of a path from »; to », and the maximum is taken 
over all such paths. Note that we can again use the method of Chap. 3 to 
determine X(v,;). All we need do is temporarily reverse the direction of 
each activity, find a longest spanning tree rooted at v,, and then restore 
the original are directions. Figure 6-5 shows an appropriate tree for our 
example, as well as the values of X(v;). 

Since the values of X(v;) are measured from the end of the project, 
which occurs 7'(v,) time units after the start, it is convenient to relate 
these times to the start by defining the following quantities: 


Lv) = T(0n) — Xs) 
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Figure 6-5 


Then L(v;) denotes the latest time at which event v; can be realized, with- 
out lengthening the project as a whole. Figure 6-6 shows the value of 
L(v,) for each vertex, alongside the value of 7(v;) determined earlier. 
The values of 7'(v;) and L(v;) always satisfy the relationship 


TQ) < LO) 


Exercise 6-5 Prove the foregoing assertion, and prove moreover that T(v;) = 
L(v;) if and only if some critical path is incident with »;. 


The values of T(v;) and L(v,;) serve as the basis for determining the 
amount of freedom available in scheduling individual activities (without 
sacrificing minimal project time). For example, the activity from v2 
to vs (see Fig. 6-3) requires one unit of time, and we have determined that 


T(v2) =3 L(v2) = 3 T(v3) = 4 Lis) =7 


We could initiate this activity as early as time 3, or as late as time 6, and 
still “realize” event v3; by time 7. Naturally, if we “use” some of the 
available slack by scheduling this activity to start later than time 3, we 
diminish the latitude available for scheduling subsequent activities, in 
this case activities (vs,v7) and (v7,vs). In general, each event v will be 
realized at a time between 7'(v) and L(v), depending on how slack is 
distributed. 

So far we have assumed that each activity invariably consumes a fixed 
amount of time and that this amount is known precisely. If this is not 
the case (and it never is), one can often reasonably postulate that the 


55 10,10 


4,7 8,11 


Figure 6-6 
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duration of each activity can be considered as a random sample from a 
probability distribution associated with the activity. One then attempts 
to obtain the best possible estimates of the parameters of this distribution 
and bases subsequent analyses on them. In the original PERT tech- 
nique, for example, it was assumed that the activity duration was drawn 
from a distribution known as a “beta distribution.” (In the present 
context, the precise nature of a beta distribution is immaterial. The 
interested reader may follow this up by referring to documents describing 
PERT.) For each activity, one then obtained from appropriate knowl- 
edgeable individuals three time estimates: a, m, and b. Here a and b 
are the estimated shortest and longest times which the activity might 
reasonably take and m is the time considered most likely. From these 
the mean time 7 and standard deviation o are estimated by means of the 
formulas 


i= 


(a + 4m + b) 


(b — a) 
5 - 


6 


With such estimates, one can first of all determine 7'(v,), L(v;), and 
slacks, based on mean values, in the manner described above. In addi- 
tion, the variability of the activity durations can be used to estimate 
such project characteristics as the probability that the project can be 
completed in a specified total time. 

Even if the distribution associated with each activity is considered 
to be known quite precisely and is assumed to be independent of the 
distributions of other activities, considerable mathematical complication 
arises and one must resort to approximation techniques. To illustrate 
just one complication in a general way, suppose that path P is a critical 
path and P’ is a slightly shorter path from the starting event to the end 
event, based on mean or expected activity times. It may happen that 
there is little time variation associated with the activities along P, but a 
great deal along P’. The distributions associated with P and P’ might, 
for example, be as in Fig. 6-7. In this case there is a substantial prob- 
ability that P’ will in fact be a greater limiting factor than P, so that 
conclusions based on P as “the” critical path may be rather inaccurate. 


ie 
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Figure 6-7 
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In addition to considering project duration, as it is related to activ- 
ity durations, it is often important to consider additional (or alternative) 
quantitative characteristics such as the expenditure of manpower and 
money. Moreover, these characteristics may interact. For example, 
in some cases one can shorten activity duration by expending more 
money or allocating additional manpower. A considerable amount of 
work has been done, and continues to be done, toward solving the variety 
of problems which result when the objective (i.e., the measure one seeks 
to optimize) and/or the constraining conditions are varied, or when 
one’s relationship with the project is changed. (For example, if one must 
irrevocably make all decisions, with respect to allocation of effort to 
activities, prior to initiation of the project, one approach is required. 
On the other hand, if one can allocate or reallocate effort during the 
project, based on performance to date, quite another methodology is 
appropriate.) Many of these techniques are effectively implemented 
through the use of digital computers. It has been our objective here to 
emphasize the basic role played by the activity graph underlying such 
methodologies. For basic references see papers by Kelley," Kelley and 
Walker,®! and Malcolm et al.*! For an interesting discussion of the 
underlying assumptions see the paper by MacCrimmon and Ryavec." 


Addendum 


PERT illustrates that graph theory is a powerful tool for solving a project 
planning and scheduling problem. From a graph-theoretie point of view PERT 
is concerned with a time relation defined on a graph. This time relation permits 
the determination of time schedules, event time distributions, and a longest-time 
tree (critical path). 

The success of PERT stimulated the application of graph theory to other 
project management problems (Bigelow, 1962). Since original PERT was con- 
cerned with a time relation defined on a graph, it is not surprising that other rela- 
tions such as cost and resource relations were also defined on graphs. Here 
resources are men, materials, and facilities. In addition to numbers, one can 
associate with each are of a graph such functions as time-cost and time-resource 
trade-off. These functions are interpreted to indicate how the cost or resource 
usage of an activity varies with activity duration. Defining cost functions for 
each are of a project graph permits the computation of cost-time trade-off curves 
for projects. A number of algorithms have been developed to compute trade-off 
curves between project duration and project cost. These algorithms can be 
employed to determine project schedules which minimize project cost for a given 
project duration. 

The project cost curve algorithms of Kelley (1961), Fulkerson (1961), and 
Grossman and Lerchs (1961) illustrate the use of graph theory in model formula- 
tion. computational algorithms, and development of simple proofs. The difficulty 
of the above three papers is inversely proportional to the amount of graph theory 
they contain. Kelley's proofs, based on a parametric linear programming method 
of Gass and Saaty (1955), are forbidding. Fulkerson’s method of translating the 
original parametri¢ linear programming problem into a network flow problem is 
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simpler than Kelley’s approach, and the completely graph-theoretic approach of 
Grossman and Lerchs is nearly intuitively obvious, but nevertheless rigorous. 
An approach similar to that of Grossman and Lerchs has been applied by Berman 
(1964) to nonlinear cost functions and by Fey (1964) to multiple project scheduling. 

Numerous attempts have been made to deal with resource functions defined 
on graphs (Lambourn, 1963; Fey, 1964). Here the typical problem is to allocate 
resources in such a way that schedules will be met and that the amount of resources 
needed by projects will not exceed the amounts available in any one time period. 
There remain a number of open problems in this area of resource allocation. 

Before leaving PERT, we should point out that its basic assumptions have 
been questioned (MacCrimmon, 1964). One of the questions arises from the 
fact that activity durations are sometimes given in terms of a probability distribu- 
tion and not as real numbers. To face some of the difficulties created by stochastic 
variables, Fey (1963) and Van Slyke (1963) have implemented a network simula- 
tion approach. 
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COMBINATORIAL PROBLEMS 


6-5 EXAMPLES OF COMBINATORIAL PROBLEMS IN 
GRAPH THEORY 


In this section we shall briefly allude to computational problems that 
arise in the subject. Because the methods used can be intricate and 
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outside the scope of the book, we shall be content to deal with one or 
two interesting problems. 

In counting problems one must distinguish between a labeled and an 
unlabeled or free (topological) graph. Two graphs whose vertices are 
labeled are the same if and only if for all 7 and j the same number of 
edges are incident with the vertices labeled i and j in both graphs. 
Thus two graphs may be considered distinct even though they may be 
isomorphic. 

One may reverse the process by considering graphs with a given 
number of unlabeled vertices and a given number of labeled edges. 
Because labeling distinguishes among graphs in spite of their topological 
equivalence, it is easier to carry out computations for labeled graphs; 
for one does not have to determine how many graphs are equivalent and 
thus reduce the total number. How many graphs have a certain prop- 
erty, e.g., cycles of length 3, is one question that occurs frequently in 
such investigations. 

For example, the number of labeled graphs (which may be discon- 
nected) with n labeled vertices and k unlabeled edges in which a pair 


of vertices is incident with at most one edge is ee im Y/ ) Thus 


we take the edges of the complete graph on n vertices k at a time to obtain 
the desired number. If we take 4 vertices and 4 edges, there are 15 
possible labelings on two topologically different graphs. In fact, Fig. 
6-8 gives the number of labelings for topologically unequivalent graphs 
of four vertices with respectively three, four, five, and six edges.** 
Many counting problems in graph theory are abstractions of physi- 
cal problems (especially from statistical mechanics), and their graph- 
theoretic formulation clarifies the counting process. Some of these con- 
cepts are related to special kinds of trees. A graph without articulation 
points is called a star, and hence a connected graph may be regarded as a 
collection of stars which meet at articulation points. The usual defini- 
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tion of a tree indicates that a tree is a graph with articulation points 
whose constituent stars are single edges. If the constituent stars are 
polygons, the graph is known as a Husimi tree. Figure 6-9 becomes a 
Husimi tree if the two articulation points are joined by a second chain. 

If the constituent stars are more general, the connected graph is 
known as a star tree. If all the stars are the same, it is a pure star tree; 
otherwise, it is called a mixed star tree. When the types of stars are not 
prescribed, we simply have a connected graph. A tree whose edges are 
designated as either positive or negative is known as a signed tree. 

Many combinatorial problems relating to graph theory lead to inter- 
esting formulas. For example, one can show by elaborate means that 
the number of graphs with ” labeled vertices consisting of k disjoint trees 


is given by” 
k 
J 1\) (k -1 ; 
BD, (-5) G)Q45- 1) era! 


j=0 


In the process of the derivation of this quantity it is possible to determine 
the number of trees of the complete graph with n labeled vertices. This 
number was first proved by Cayley to be n"-*. We now give an interest- 
ing inductive proof of this fact.” 

Theorem 6-4 The number of spanning trees of the complete graph 
with n labeled vertices is n*~*. 

Proof: To avoid unnecessary manipulations, let us first point out 
that the following identity is known in analysis: 


a—-1 


MS (") gn — ft = 2n-*(n — 1) 


j=) 


The theorem that a complete graph has n"~* trees is clearly true for 
a graph with a single vertex as 1}? = 1. Let us now assume the theorem 
to be true for the number of trees in a graph with vertices <n — 1 and 
prove that it is true for one with n vertices. Let us denote the number 
of trees of this complete graph by 7, We then divide the n vertices into 
two sets, one with 7 elements and the other with n — 7 elements, where 7 
can assume any of the values 1, 2,...,%-—1. By the inductive 
hypothesis the number of trees of the first subgraph is 7'-* and that of 
the second is (n — 7)"-'-*. Now we examine all ways in which a tree of 
the first subgraph can be combined with a tree of the second in order to 
form a tree for the entire graph. Since the connection can be made from 
any of the 7 vertices of the first subgraph to any vertex among n — 7 in 
the second, the total number is i(n — 7). Thus the number of trees 
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of the entire graph obtained from the partition for one value of 7 is 
a(n aad aii-2(n rot aj = Un _ ) hea 


However, the 7 vertices can be chosen from among the n vertices in 


n! a (™ aes 
@—om~ \i) Wy 


and hence if we multiply the above number from a single partition by this 
quantity and sum over 7, we obtain 


n—-1 
by (:) in — iro 


t=1 


There remains the question of duplications. Some trees of the origi- 
nal graph can be obtained in this last sum in more than one way. In 
fact, since there are n — 1 ways to choose the subgraph with @ vertices, 
as the values of 7 increase toward n — 1 those of n — i decrease toward 1, 
thus interchanging the role of the subgraph with 7 vertices with that 
which has n — 7 vertices, and vice versa. Thus there are n — 1 pairs 
of subgraphs with (1, n—1), (2, n-—2),..., @ n—@,..., 
(n — 1, 1) vertices, each pair giving rise to two trees of the original graph, 
since for example, (1, n — 1) can be taken in two ways. Thus the last 
sum must be divided by 2(m — 1) to yield the total number of distinct 
trees of the original graph. By using our above-cited result from analy- 
sis, we obtain n”~* for the number of trees. 


Exercise 6-6 Verify the factor of 2(m — 1) duplications on the complete graph 
with four vertices. Note, for example, that the tree (a) shown below arises in 
six ways. Two are accounted for by connecting the subgraphs as indicated in (6), 
where in one case the isolated vertex is the first partition and (c) is the second, and 
where in the other case (d) comprises the first partition and the isolated vertex com- 
prises the second. This accounts for two ways. Two more ways are obtained 
from (e). The final two duplications are obtained from (f). 


fa) (6) (¢) (d) 


We now offer a proof due to Trent.’ Let A be the incidence matrix 
(less 1 row) of the complete graph on n vertices, with n — 1 independent 
rows. It is known that the number of distinct trees in any graph is given 
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by the determinant of AA’, which we denote by |AA’|. Here AA’ takes 
the form 


Let 7 be a second matrix of order n — 1 whose elements are given by 


ty = 2 t<n-1 
tj =1 t#j 
tana. = 1 


It is easy to show that |7| = 1. 
Now consider the determinant of the product 


n 0 0.0 
Oe a =O 
|\T(AA)| = |T||AA’| = det -- 2200s ee =n 
Oe Oe ea? one 
ton Viel 


Since |7| = 1, it follows that |AA’| = n*-*. This completes the proof. 

As another illustration let us compute the maximum number of cycles 
having three arcs that a complete antisymmetric graph of n vertices 
can have (Ref. 5 of Chap. 1). Consider the vertex matrix associated 
with this graph. The 7th row of the matrix gives the incidence relation 
for ares positively incident with the7th vertex, whereas the 7th column gives 
the incidence for arcs negatively incident with this vertex. If r; denotes 
the sum of elements in the ith row and ¢; denotes the corresponding sum 
in the ith column, then 7; + ¢; = n — 1, since the 7th vertex is connected 
by n — 1 edges to the remaining n — 1 vertices. 
ah However, 
this is not the number of cycles. A cycle must have all edges cyclically 
directed. Thus if two ares are positively incident with a vertex, they 
cannot be two sides of a cycle, because their orientations are opposite 
for that cycle. 

Since the sum r; of the elements of the ith row gives the number of 


ares directed from the ith vertex, we must exclude from the total num- 
n 


ber of circuits the quantity ) : 
i=l 


The total number of circuits with three edges is( 


2 A that is, the sum of the 7th row taken 


two at a time and summed over all rows. This gives for the number 
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@)-2@)-G)-a2¢-9 


Since our graph is complete, the number of its edges is (3) and we must 


of cycles 


a because the total sum of all rows must account for 


all the edges of the graph. 
We now have for the number of cycles 


(5) +26) -ad" 


and the problem is to determine r; so that this quantity is maximum. 
The choice of r; corresponds to a complete graph with special orientation 
that maximizes the number of cycles. It is sufficient to determine r; 


have y = ( 
1 


i= 


n 
so that y r? is minimum, because this quantity is subtracted from a 
i=l 

constant amount in the above expression which is to be maximized. The 
foregoing argument would also be valid if we used the sum ¢; of the column 
elements and the fact that two arcs having the same orientation toward 
a vertex with which they are incident cannot be two sides of a cycle. 
We would then have to find ¢; which maximizes 


(+20) 226 


Thus we would have the maximum number of cycles if ¢; which minimizes 
n 


i= 
is found. 

Thus the maximum number of cycles is symmetric in r; and ¢;, that is, 
they must be equal. Since r; + ¢; = n — 1, we have, when n is odd, 
ri = (n — 1)/2. 

Exercise 6-7 Substitute this value of 7; and obtain an explicit expression for the 


maximum number of cycles. Also, find 7; when is even and complete the computa- 
tions for this case. 


Remark: Another interesting question is to derive a formula for the 
average number of end points of a tree given at random. Frequently, 
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the computations do not lead to a convenient formula for the result. 
However, by assuming large values for n, one can obtain an asymptotic 
formula which is easier for computation. For example, the mean number 
of end points of a tree taken at random from among the trees whose 
number was computed above is n/e, where e is the base of the natural 
logarithm, that is, e = 2.718 - +--+. In fact, one can also compute a 
probability distribution for this mean number. 

Formulas have been derived for the number of rooted graphs, i.e., 
graphs in which a vertex called a root is singled out, and others for com- 
puting rooted star trees. Still other results have been concerned with 
digraphs (directed graphs) and with graphs having strength k; that is, 
edges of multiplicity up to k can be incident with a pair of vertices. 


In a complete graph with n labeled vertices there are (3) edges. 
The number of graphs with N edges is obtained from 


that is, by taking the e edges N at a time. Suppose N edges are 


n 
2 
ability that the resulting graph is connected? The graph may consist 
of several components; what is the size of the largest tree, i.e., how many 
edges does it have? Note that here two points are allowed to be incident 
with only one edge. However, one may allow multiple (i.e., parallel) 
edges and ask similar questions. 

It can be shown that for large N, the total number of connected 


chosen at random from among ihe ( ) original ones; what is the prob- 


n n 
graphs is o(2) in the labeled case and 2(3) /n\ in the unlabeled case. 
A birth process or a stationary branching process, also called a 
multiplicative process, can be represented by a rooted tree, and relevant 
questions can be studied and answered for the associated tree.°* Here 
we have an individual wp (corresponding to the root of the tree) who 
gives birth to w individuals w, we, ... , where w = j with probability 
Pj; w in turn gives birth to wj1, we, ..., and soon. The births are 
mutually independent, and all individuals have the same probability 
distribution of offspring. The probability P,, that the tree is finite with 
n vertices has theasymptotic form P, ~ A-*n— for large n(n = 1 mod q), 
where A is a constant and q is the largest integer such that if p,; ¥ 0, 
then j is a multiple of g. P, = 0 for other values of n, One can also 
obtain asymptotic expressions for the probability that w = 7 and for the 
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distribution of the fraction of vertices with k outgoing segments in trees 
which are finite and which have n vertices. 


Application of Polya’s Theorem to Enumeration Problems 


Several basic enumeration problems in graph theory can be resolved 
by appropriate applications of a fundamental combinatorial theorem 
due to Polya.™! Among these are the problems of counting the number 
of nonisomorphic simple graphs having p vertices and q edges, or the 
number of nonisomorphie simple directed graphs having p vertices and 
q ares, and generalizations of these problems when the graphs are not 
necessarily simple (but when the maximum number of parallel edges or 
strictly parallel ares is bounded). 

The solution of these and of related counting problems by the use 
of Polya’s theorem is developed fully by Harary.** In order to illustrate 
the basic method in several simple settings, we shall consider the follow- 
ing specific problems: 


1. Find the number of nonisomorphic simple graphs having five 
vertices and q edges, for all q. 

2. Find the number of nonisomorphic simple directed graphs having 
four vertices and q arcs, for all g. 

3. Find the number of nonisomorphic graphs having four vertices 
and q edges, for all g, where at most two edges join any pair of vertices 
and where loops are prohibited, 


Before introducing Polya’s theorem and its application to these 
problems, some preliminaries are necessary. The first of these deals 
with groups of permutations: A permutation of degree k is an operator 
which, when applied to any ordered set of k elements, rearranges them 
into a new order. (If each element retains its original position, the 
permutation is called the identity permutation.) Since the nature of the 
objects being permuted is immaterial, a permutation of degree k can be 
characterized in terms of the integers 1 to k, which represent the posi- 
tions of the elements in the ordering. For example, the scheme 


Old position: 1 2 3 4 5 6 


- New position:3 2 5 6 1 4 


characterizes the permutation of degree 
6 in which the first element becomes 
third, the second remains second, the 
third becomes fifth, ete. 

5 3 The preceding permutation is also 
represented, in an obvious way, by the 
directed graph shown in Fig. 6-10. 
Figure 6-10 More generally, any permutation 


4 
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of degree k can be represented by an appropriate directed graph whose 
vertices correspond to the integers 1 to k and which is such that the 
positive and negative degree of every vertex is 1. We have seen that 
such a directed graph necessarily decomposes into one or more vertex- 
disjoint simple cycles (some of which may be loops). In fact, still 
another useful notation for the above permutation is the so-called cyclic 
representation 
(1,3,5) (2) (4,6) 


In general, the cyclic representation is interpreted as follows: The posi- 
tion represented by any integer maps into the position corresponding 
to the next integer on the right, except that the rightmost position within 
a grouping maps into the leftmost. 
Given a permutation of degree k, the type of the permutation is 
specified by stating how many cycles of length 7 it contains for 7 = 1, 2, 
. , k. If mn; denotes the number of cycles of length 7, then the per- 
mutation’s type can be conveniently expressed as the vector (1,n2, 
. 5m). Clearly, the type must satisfy 


l-m+2-m+°° > them =k 
(Why?) The specific permutation described above is of type 
(1,1,1,0,0,0) 
while the permutation of degree 12 whose cyclic representation is 


(1,4,2,6)(3) (5,7,9,8)(10) (11,12) 
is of type 
(2,1,0,2,0,0,0,0,0,0,0,0) 


Another convenient way to represent the type of this permutation is 
by the expression 
yr’ya'ys? 
in which subseripts denote cycle lengths and exponents denote corre- 
sponding numbers of cycles. (The symbol y has no significance except 
as a base symbol to which one can attach subscripts and exponents.) 
Note that if no cycles of length 7 occur, the factor y;° may be omitted. 
Given a set P consisting of k permutations of degree h, let 


Nisin yen Jk 


denote the number of permutations of type (ji,j2,... ,Jx). Then 
the formal series 


1 sith 
Z(P) = iD Vind. YY > > + Yate 
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where the summation is over all meaningful types, is called the cycle 
index of P. 

We shall be concerned with sets P of permutations (of the same 
degree) which form a group relative to the binary operation of applying 
two permutations successively. Thus P must contain the identity 
permutation, the inverse (or reverse) of each of its permutations, and 
the “product” (i.e., resultant) of any two of its permutations. 

The permutations of interest in the present discussion are the per- 
mutations of the set of all unordered pairs (or, in the case of a directed 
graph, ordered pairs) of vertices of a graph which result when the vertices 
themselves are permuted. For example, if the vertices of a four-vertex 
graph are permuted as shown in Fig. 6-1la, then the unordered pairs 
of vertices are permuted as shown in Fig. 6-11b. In this particular exam- 
ple, note that a permutation of the four vertices whose type is yi'ys! 
induces a permutation of the six unordered pairs of vertices whose type 
is y;*. Similarly, each of the n! possible permutations of n vertices 
induces some definite permutation of the n(m — 1)/2 unordered pairs 
of vertices [or of the n(n — 1) ordered pairs if we wish to study directed 
graphs]. 

In what follows, we shall need to know the number of permutations 
of each type induced by all possible permutations of four vertices, for 
both unordered and ordered pairs, and by all possible permutations of 
five vertices, for the unordered case only. A discussion of how this 
information may be obtained in general is contained in Ref. 44. For 
the cases of interest, this information is summarized by the following 
eycle indices: 


Four vertices, unordered pairs: 


Pa(yi® + Yys?y2? + Sys? + 6y2'y4?) 
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Four vertices, ordered pairs: 
ze(yr + 6yr2y25 + 3y2° + 8yst + 6ys') 
Five vertices, unordered pairs: 


rao (yil? + LOyrtys® + 20yr ys’ + 80ye2y2! + L5ystys? 
+ 20yr'ya'ye* + 24y5*) 


As a further preliminary to describing Polya’s theorem, we introduce 
several auxiliary notions. Consider a set of abstract objects called 
figures and assume that one of several nonnegative integers (we shall 
use only 0, 1, and 2) is associated with a figure and is referred to as its 
content. (In a more general setting, Polya’s theorem permits one to 
associate a vector of integers with a figure.) If a, denotes the number 
of different figures having content k, then the formal series 

A(z) = ) aot 
k=0 
is called the figure-counting series. (Here x is a dummy variable.) 

A configuration of length s is a sequence or ordered set of s figures. 
The content of a configuration is taken to be simply the sum of the con- 
tents of its figures. Certain configurations of length s are regarded as 
being equivalent to one another. Specifically, let P be a group of per- 
mutations of degree s, and let h denote the number of permutations in 
the group. Then two configurations are said to be equivalent configura- 
tions relative to P if and only if one is transformed into the other by an 
appropriate permutation in P. 

If b, denotes the number of inequivalent configurations (of length s) 
having content k, then the formal series 


B(a) = . bya® 
k=0 


is called the configuration-counting series (relative to P). 

Polya’s theorem enables one to determine B(x) from the figure- 
counting series A (a) and the cycle index Z(P) of the permutation group P. 
Specifically, we have (without proof): 

Theorem6-5 (Polya.) If A(x) andZ(P) denote the figure-counting 
series and the cycle index of P, respectively, then the configuration- 
counting series B(x) may be found by substituting A(z*) for each occur- 
rence of y; in the cycle index of P. 

Now consider once again the problem of counting all nonisomorphic 
simple graphs having five vertices. Take as figures the 10 unordered 
pairs of distinct vertices. Adopt the convention that a figure has con- 
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Figure 6-12 


tent 1 or 0 depending on whether an edge joins the vertices or not. Thus 
the figure-counting series assumes the simple form 


A(z) =1+2 


Next, consider the configurations of length 10 corresponding to sequences 
formed from the 10 figures. The appropriate permutation group P 
consists of the set of permutations of the 10 figures (that is, unordered 
pairs of distinet vertices) induced by the group P* of all possible permu- 
tations of the 5 vertices. (Note that there are 5! such permutations.) 

When 1 + z* is substituted for each occurrence of y, in the cycle 
index Z(P) given earlier and the resulting expression is simplified, we 
obtain 


Bia) = 1 + & + 227 + 4x + Gat + 6a? + 6x + 42? + 22° 4+ 2° 4+ 2 


We may conclude, for example, that there are 6 distinct graphs having 
4 edges, because of the presence of the term 6z‘. These are exhibited 
in Fig. 6-12. 

To find the number of distinct simple directed graphs having four 
vertices, we interpret figures as ordered pairs of vertices. The same 
figure-counting series A(z) = 1+ <2 is appropriate, since an ordered 
pair of vertices either is or is not joined by an are. The cycle index 
Z(P) for the group P of permutations of the 12 ordered pairs of vertices, 
induced by all possible permutations of the vertices, was noted earlier. 
By substituting A(2*) for yin Z(P) and simplifying, we obtain 


B(z) = 1+ 2+ 52? + 132? + 2624 + 3825 + 482° + 3827 
+ 262° + 132° + 5a? + 24 + 2? 


From the presence of the term 5x’, for example, we conclude that there 
are five distinct graphs having two ares. These are shown in Fig. 6-13. 

We turn now to the problem of counting the number of distinct 
graphs having four vertices, no loops, and at most two edges joining 
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any pair of vertices. In this case, a figure is once again interpreted as 
an unordered pair of vertices. However, the content may now be 0, 1, 
or 2, depending on the number of edges joining the vertices. Thus the 
figure-counting series is 


A(z) =1+2+2? 
When 
A(@') = 1+a*+ 2% 


is substituted for each occurrence of y, in the appropriate cycle index 
(given earlier), we obtain 


Biz) = 1+ 2 + 3a* + 5a9 + 824 + 925 + 120° + 9x7 + 82° 
+ 5a? + 3a 4+ gi + zh 


Thus there are, for example, eight distinct graphs, of the type being 
considered, having four edges. These are exhibited in Fig. 6-14. 

By modifying the definitions of figures, contents, and P, other graph- 
theoretic enumeration problems can be resolved. See Ref. 44 for a dis- 
cussion of these variations. Further discussion and a proof of Polya’s 
theorem, in a more general form, may be found, for example, in Ref. 24 
of Chap. 1. An interesting discussion of some solved and unsolved enu- 
meration problems may be found in Ref. 48. 


6-6 MINIMUM BRICK FACTORY CATASTROPHES 


In a brick factory there are m ovens in which bricks are baked. 
The bricks are then loaded on a small special-rail car at each oven and 
pushed to any one of n platforms where a loading truck may be avail- 
able. Since each oven must be connected by rail to each loading plat- 
form, the rail lines have a great number of intersections. As the cars 
go over the intersections they are often derailed, causing spillage of 
bricks and a traffic jam within the factory. The problem is to construct 
the rails from the ovens to the destinations with a minimum number of 
crossings and thus minimize the hazards of derailment. 

This problem may be solved within the framework of graph theory, 
where the rail lines correspond to edges of a graph connecting vertices 
(corresponding to the ovens) to other vertices (corresponding to the load- 
ing platforms). One condition is imposed, and that is that no three edges 
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may intersect at the same point unless it is a vertex. Two edges, how- 
ever, may intersect at an intermediate point. For example, in the case 
of four ovens Oi, ... , Oy and four platforms Py, ..., Ps we have 
four intersections marked with z in Fig. 6-15. 

Theorem 6-6 The minimum number of interior intersections of 
edges joining every one of m points to every one of n points in the plane 
(two edges are assumed to intersect at no more than a single point) is 
not less than: 


(r? — r)(s? — 8) ifm = 2r,n = 28 

(r? — r)s* if m = 2r,n = 28+1 
r?(s? — s) ifm = 2r +1, = 2s 
reg? ifm =2r+1,n=2s+1 


Before proving the theorem we shall introduce the notion of a fan 
and prove a lemma. 

Definition A fan at a vertex v consists of all edges incident with o 
minus their end points. Thus » is also excluded from the fan. 

Remark: Note that if we are given two sets of three vertices in the 
plane and we form the three fans at the vertices of one set, each of which 
fans has all three vertices of the other set as end points, we obtain the 
utility graph of Kuratowski, which is nonplanar, and hence the fans must 
have at least one point of intersection which is not one of the vertices. 

Lemma 6-7 Consider a planar graph consisting of three fans at 
the vertices w1, us, us all having the same end points v1, . . . , Ym together 
with the vertices themselves. The number of interior-point intersec- 
tions, assuming that no more than two edges intersect at any point, is 


at least 
pit r—r  ifm=2r 
2 ifm =2r+1 


Proof: The proof is inductive. 

The lemma is true if r= 1 by 

So theabove remark, Let us assume 

Vc, TSA the lemma to be true for r and 


z sh re % show that it is true for r + 1, in 
Ie which case the number of inter- 
sections must be at least 
Ny ror ifm = 2r +2 
+? if m = 2r +3 
P, 


Now consider the subgraphs Cy 
Figure 6-15 (k =1, . . . ,m) consisting of the 


0, 
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vertices », (k = 1, ... ,m), the fans ato, (k = 1, . . . , m) determined 
by the vertices u;, w2, us, and the vertices w1, wz, us. If every pair of these 
subgraphs had a point in common other than the end points of the fans, 
then the number of intersections would be obtained by considering these 
subgraphs two at a time. This would give for the intersection number 


-1) 
G)-82-. 


But if m = 2r + 2, then a = 27? + 8r+1> 72+ 7; and ifm = 2r + 3, 
then a = 2r?+ 5r +3 > (r+ 1)*% In that case the lemma would be 
proved. Suppose, then, that some pair of subgraphs C,, and C;, have 
no point in common except wu, U2, us. Consider the subgraph C’ con- 
sisting of C;, and C,, taken together. By the remark, every other sub- 
graph must have at least one interior intersection point with C’. Since 
there are m — 2 remaining subgraphs, there are at least m — 2 inter- 
sections with C’ and no two intersections are the same, since no three lines 
can intersect at a nonterminal point. We add to this number what we 
obtain by applying the induction hypothesis to the minimum possible 
number of intersections on our graph without », and »,, and the fans 
C,, and C;,. We have 


(2r) + (7? — r) 
(r+ 1) +9? 


Note, for example, that if the number of vertices is 2r, then there are 
at least r? — r intersections, which we add to the 2r intersections with 
C’. This completes the proof of the lemma. 

Proof of the theorem: Again the proof is inductive. The theorem is 
true forr = 1,s = 1 by theremark. We prove that if the theorem is true 
as stated for m and n, then it is also true for the combinations (m,n + 1), 
(m+1, n), and (m+ 1, n+ 1). Suppose that we have m ovens 
O.,..., On and n+ 1 platforms Pi, ..., Paz. The graph of this 
interconnection may be obtained from the graph @ of the problem with 
O,,...,0O,and P;,... , P, by adding the vertex P,4.1 and its con- 
necting edges to O;,..., Om. Now G itself may be considered in 
terms of fans at P;, . . . , Pn, all with the same end points 01, . . . , On. 
Let n be even, that is, m = 2s; then we shall consider the least number of 
intersections of the fan at P,y: with the fans at P: and at P»: taken 
together, with the fans at P; and P, taken together, and so on. We 
have s such pairs, and by the lemma if m = 2r, there are at least r? — r 
intersections with each pair and hence the total would be at least (r? — r)s; 
if m = 2r + 1, the number of intersections would be at least r?s. If n 
is odd, that is, n = 2s + 1, the single remaining fan at P,, can be ignored 
and the same numbers (r? — r)s and r’s are obtained. Now by the 


rtr ifm — 2 = 2r 
(P+1 ifm—2=2r4+1 
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induction hypothesis, G itself has at least (r? — r)(s? — s) intersections 
which we now add to the above result, yielding for the smallest possible 
number of intersections 


(P—rn(e— as) +? —rs=(?—re? ifm =2r,n=28+1 
from which we have 
(= st + (= 1s = GF = N+ 8 - +) 

ifm = 2r,n = 28+2 
Also, we conclude that the least number of intersections is 
r2(s? — 8) + 72s = rg? ifm = 2r+1,n=2s8+1 

from which we have 

rg? + 72s = r*[(s + 1)? — (8 + 1)] ifm = 2r+1,n = 28+2 


thus proving the theorem. 

The construction of the connections with the least number of inter- 
sections may be obtained as follows: Consider rectangular coordinates 
in the plane. If m = 2r, take on the z axis the points with abscissas 


=f =P =D), es eg ey Std eh 
and if m = 2r + 1, take on this axis the points with abscissas 
—7, -(—1),...,—2, —1,1,2,...,7¢+41) 


If n = 2s, take on the y axis the points with ordinates 
—s, —(s—1),...,—-2, —1,1,2,...4,8 
and for n = 2s + 1 take the points with ordinates 
=—3, —(9— 1),... , —2, —1,1,2,... 49%, @-+- dD) 


and then join with a straight line segment all the points from the « axis 
to all the points of the y axis. In this case one can easily count all the 
intersections. 

Remark: One can also prove by induction that the smallest number 
of regions in the plane obtained by construction of the foregoing connec- 
tion graph is 


(r? — r)(s? — s) + 4rs — 2(r + 8) + 2 if m = 27, n = 28 
(r? — r)s? + 4rs + 28 + 1 ifm = 2r,n = 28+ 1 
rs? + 4rs +1 ifm =2r+1,n=28+1 


Exercise 6-8 Construct the connections for the five-oven four-platform problem 
according to the foregoing procedure. 


Exercise 6-9 Repeat Exercise 6-8 for the five-oven and six-platform problem. 
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6-7 THE MINIMUM NUMBER OF INTERSECTIONS IN 
COMPLETE GRAPHS 


Zarankiewicz’s™ result, discussed above, concerns the minimum number 
of edge intersections, when drawn in a plane, of the bipartite graph con- 
sisting of two sets of vertices with one edge joining each element of one 
set to each element of the other set. When each set has three vertices, 
we have one of the two basic nonplanar graphs appearing in Kuratowski’s 
theorem characterizing planar graphs. By generalizing on this graph, 
Zarankiewicz was able to establish his result and also to indicate a scheme 
for its realization. 

One can pursue*:45.79 a similar investigation of the n-vertex general- 
ization of the other basic Kuratowski graph: the (simple) complete graph 
on five vertices. The main results of the investigation® will now be 
described. Let G, denote the complete graph on n vertices. We wish to 
determine J,,, the minimum number of edge intersections when G, is drawn 
in a plane in such a way that at most two edges intersect at any point 
other than a point representing one of the original n vertices. 

We need only consider n > 5, since clearly J, = 0 forn <5. In 
order to obtain an upper bound M/, on J,, consider the following specific 
representation of G, in a plane, called the alternating linear model. Choose 
a horizontal line segment S in the plane, and subdivide S into n — 1 seg- 
ments by introducing points p;,..., P, corresponding (from left to 
right) to the vertices of G,. Join p; to ps, pa, . ~~ , Pn by means of 
semicircles lying above S. Then join pa to ps, ps, - . . , Pn by means 


of semicircles lying below S. In general, join p; to piz2, Pisa, . ~~ y Pn 
fori = 1,2, ...,n — 2, using semicircles lying above (below) S if 7 is 
odd (even). 


The construction for n = 6 and n = 7 is shown in Fig. 6-16. Note 


Figure 6-16 
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that the number of edge intersections is 4 for Gz and 11 for G7, In gen- 
eral, if M, denotes the number of edge intersections in the alternating 
linear model of G,, it can be shown (do as an exercise) that 


n(n — 2)*(n — 4) 
48 
(n — 1)(n — 3)(n? — 4n + 1) 
48 


n even 
M, = 


n odd 


Thus M,, as given above is an upper bound on J,. It is further shown in 
Ref. 79 that a somewhat smaller upper bound M/, on J, is given by the 
following formulas: 

n(n — 2)2(n — 4) 


nm even 
Me =} 9 — yi — 3) 
<a n odd 


Note that for x even, M’ = 4M,. For n odd, M/, agrees with $M, with 
respect to the coefficients of n‘ and n*. 

The foregoing result can also be obtained, for example, in the even 
case by distinguishing r = n/2 pairs and forming all G,’s between the 
pairs. Each such G, must be drawn in such a way that the six edges do 
not intersect one another. Then it can be shown that there are at least 
(3) fé 5B ‘) intersections. On putting r = n/2, one has the result. 

If there exists a minimal-intersection representation of G, which has 
the property that it contains a minimal-intersection representation of 


A 


C= 


Figure 6-17 
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G,-» for each even k < n(k = 2, 4, 6 are sufficient), it can be shown 
inductively that M/, given above, is precisely /,,. 

The values of M‘, and M;, are 3 and 9, respectively. The solid lines 
in Fig. 6-17 constitute a representation of Gs which realizes Mj. By 
adding the dashed lines, we obtain a representation of G; which realizes 
M;. It would be interesting and useful to attempt to find a specific 
straight-line construction for G@, for all n > 5. 


Exercise 6-10 Draw graphs comparable to Fig. 6-17 for n = 8, 9. 


Exercise 6-11 To construct a graph with minimum intersections for the case 
n = 10, use two concentric polygons with an equal number of vertices. Join the 
vertices of the exterior polygon symmetrically on each side in the outer region, those 
of the interior polygon by straight lines in its interior, and those of the two polygons 
symmetrically in the enclosed region. Generalize this method. 


PUZZLES AND GAMES 


6-8 A PROBLEM INVOLVING COLORED CUBES” 


In most problems considered so far the original formulation has either 
been directly in terms of a given graph or else has been such that an 
immediate reformulation in terms of an obvious appropriate graph was 
possible. In some situations, however, the real problem is that of dis- 
covering an appropriate graph, whose structure may bear little superficial 
resemblance to the original problem, The following example is a simple 
case in point. 

Let C; to C; denote four congruent cubes and let Y, R, B, andG denote 
the colors yellow, red, blue, and green, respectively. Suppose that each 
face of each cube is colored with one of these colors in such a manner that 
every color is represented on every cube. (Aside from this constraint, 
the colors are assigned independently to each cube’s faces.) Consider the 
following problem: For a given assignment of colors find (if possible) a 
way to stack the cubes in a single pile (forming a prism with square base) 
so that the four squares on each lateral side of the prism have distinct 
colors. 

Remark: Without further constraints, the problem has no solution in 
some cases. For example, suppose that on each cube all three faces 
meeting at a corner are red. Then no matter how the cubes are oriented 
and stacked, the lateral sides of the prism will include eight red squares. 
A solution requires precisely four. 

For any coloring of the cubes, consider the following associated graph 
having 4 vertices and 12 edges. The vertices correspond to the colors Y, 
R, B, and G@. For each cube C; there are three edges labeled 7. These 
correspond to the three pairs of opposite faces-and join the appropriate 
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Figure 6-18 


vertices (colors). Figure 6-18 illustrates a valid coloring. (Note that a 
loop results if opposite faces have the same color.) In general, such a 
graph represents a permissible coloring if every vertex is incident with at 
least one edge having each of the labels 1, 2, 3, and 4. 

Now suppose that a solution to the problem exists. Consider two 
opposite lateral sides of the resulting prism. The eight corresponding 
squares represent one pair of opposite sides of each cube, and each color 
occurs twice. In terms of the graph, there exists a subgraph having four 
edges, each labeled differently, and such that each vertex is of degree 2. 
The other pair of opposite lateral sides of the prism determine a second sub- 
graph which has the same properties and is edge-disjoint from the first. 

Conversely, it can be shown that if the associated graph has two 
edge-disjoint subgraphs with these properties, then there exists a cor- 
responding solution to the problem. Figure 6-19 shows two appropriate 
subgraphs of the graph of Fig. 6-18. 


Exercise 6-12 Establish a frame of reference by looking at the prism from a fixed 
side, referring to the front, right side, and top of each square as z, y/, and z, respectively, 
and referring to the corresponding opposite sides as x’, y’, and 2’. Use the subgraphs 
in Fig. 6-19 to determine the appropriate color of each side of each cube to solve the 
problem. 


Remark: Note that there are 41,472 possible arrangements of the 
cubes, The cube on the bottom has three possible arrangements (three 
essentially different ways of putting it on the table). The remaining 
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(a) (b) 
Figure 6-19 


cubes each have 24 possible orientations: 6 to choose the side on which it 
rests and then 4 possible rotations. 


6-9 SYSTEM TRANSITION PROBLEMS 


Many problems, when formulated abstractly, are of the following general 
type: A “system” of some sort that at any time can be in only one of a 
finite number of “‘states’”’ is given. The set of possible direct (i.e., one- 
step) transitions is known—either by explicit enumeration or by a suitable 
general rule. Given a specified initial state, the problem is to determine 
whether or not one can transform the system into a desired final state by 
means of an appropriate sequence of direct transitions. (If a cost is 
assigned to each direct transition, it may further be required to attain the 
desired state with minimal cost.) 

If the states and direct transitions are represented respectively by 
the vertices and arcs of an appropriate simple directed graph, the problem 
is one of finding a path joining a specified pair of vertices (states). In 
many cases, the critical step in the analysis of such problems is the sys- 
tem definition and, more specifically, the selection of a collection of states 
which adequately characterize the possibilities and in terms of which the 
direct transitions can be conveniently enumerated. 

While various applications of a more serious nature will occur to the 
reader, for illustrative purposes we shall here consider the puzzle of the 
missionaries and the cannibals.** Three missionaries and three cannibals 
arrive at bank A ofa river and must cross to bank B by using a rowboat 
which will accommodate only two people. All of the missionaries and one 
of the cannibals can row. Is it possible to achieve this transfer through 
a sequence of crossings such that the cannibals never outnumber the 
missionaries on either bank unless, of course, the number of missionaries 
is zero? (The missionaries feel rather strongly about this ground rule.) 

In attacking this problem, we shall take as the system the collection 
of cannibals and missionaries on bank A. Letting M, C, and K denote a 
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missionary, a nonrowing cannibal, and the rowing cannibal, respectively, 
the system can take on 24 possible states (since M, C, and K can assume 
4, 3, and 2 distinct values, respectively). Of these, the following 16 are 
feasible: 

MMMCCK MMMK CCK CK 

MMMCC MMCC MC C 

MMMCK MMCK MK K 

MMMC MMM oe: 0) 


Here O means that no one is on bank A. (The reader may verify that 
the other eight states violate the ground rule on one bank or the other.) 

Transitions to this system occur through the departure or return of 
the boat. The graph of Fig. 6-20 enumerates all (25) possible transitions. 
For simplicity, transitions are shown as edges, since either direction is 
possible, depending on whether the departure or arrival of the boat is the 
cause. However, each edge should be regarded as representing two 
oppositely oriented ares corresponding to departure (moving clockwise 
around the circle) and arrival (counterclockwise). The problem, restated, 
then becomes: Find (if possible) a path from MM MCCK to O whose ares 
correspond alternately to departures and arrivals. 

Without the last stipulation, the problem is easy. (For example, 
the sequence of states MM MCCK, MMCK, CK, O would do.) With it, 
the problem becomes considerably more difficult. Before proceeding, 
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the reader may wish to spend a few moments with Fig. 6-20, either to 
find a solution or to become convinced that there is none. 

In order to seek a solution systematically, we shall now introduce an 
auxiliary graph having the same vertices. The objective is to reflect 
round trips which start from, and return to, bank A, rather than one-way 
trips. We shall thereby eliminate the need for considering alternation of 
crossings by building it into the structure of the graph. 

Suppose, for example, that the boat is at bank A and the system is in 
state MMMC. From Fig. 6-20 we see that we can go, via state MC, to 
state MMCC or MMCK, so we construct round-trip edges joining MM MC 
with MMCC and with MMCK. Figure 6-21 shows the totality of all 
such edges (including one-way trips terminating at 0). In terms of the 
graph of Fig. 6-21, the problem becomes: find a chain from MMMCCK 
to 0. We can now readily see that such a chain exists. For example, 


MMMCCK, MMMCK, MMMK, MMCK, MMCC, CCK, MC, 0 


is such a chain, Adding (parenthetically) the intermediate states, the 
corresponding solution relative to the original graph of Fig. 6-20 is 


MMMCCK, (MMCK), MMMCK, (MMM), MMMK, (MK), 
MMCK, (MC), MMCC, (CC), CCK, (C), MC, O 


which is shown in Fig. 6-22. Note that the path is not simple, since two 
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Figure 6-21 
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ares enter MMCK and MC. (In each case, one of the ares corresponds 
to a boat departure from bank A and the other to a boat arrival.) 


Exercise 6-13 Determine whether the problem of the missionaries and the 
cannibals can be solved in fewer crossings under the less stringent assumption that all 
cannibals can row. (Note that the more stringent assumption that only one mis- 
sionary can row makes the problem more interesting.) 


Exercise 6-14 Given the directed graph shown in Fig. 6-23, find a path from » to 
w whose ares alternate between solid and dashed ares, the first being solid. 

(a) Solve the problem by inspection. 

(b) Solve the problem by applying the method of the text, by constructing an 
appropriate auxiliary graph each of whose arcs corresponds to a pair of alternating 
ares (or a single solid are terminating at w) in the original graph. 

(c) Note that the solution is not a simple path in the original graph. (In fact, 
the path returns to v.) 


Tn some cases, individual feasible transitions are obvious, but it may 


Se ees 


Figure 6-23 
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Figure 6-24 


not be clear in a particular problem whether one can get from a given 
initial state to a desired final state. An example of this is the problem of 
finding one’s way through a complex maze or labyrinth, which occurs 
very frequently in the literature of mathematical recreations. It is, in 
essence, the problem of finding an edge progression joining specified 
vertices of an appropriate graph which characterizes the structure of the 
maze. 

Consider, for example, the two-dimensional maze of Fig. 6-24. This 
maze consists of 36 compartments some of which are joined by ‘‘door- 
ways’ (indicated by breaks in the lines). Suppose that the problem is to 
reach point Q, outside the maze, starting from compartment P. Consider 
the graph whose vertices correspond to the compartments and whose 
edges indicate those pairs of adjoining compartments which are connected 
by a passageway. Tigure 6-24 shows the associated graph. In terms of 
this graph, the problem is to determine an edge progression joining P 
and Q. In principle, the problem as given is very simple. Having con- 
structed the associated graph, we can apply the labeling methods of Chap. 
3 to find a tree joining P with all other vertices in the same component. 
In particular, we shall have found a chain joining P and Q (if the problem 
has a solution). 

We have been overlooking one serious difficulty, however. A 
straightforward labeling technique requires systematic scanning of the 
edges and vertices and assumes that we actually know the structure of 
the entire maze. In practice, if we are in the maze, say, at point P, we 
have at the outset only very local information, namely, we know only 
which compartments are directly reachable from P. Additional informa- 
tion is learned only gradually, by probing in various directions. 
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All the numerous existing techniques for finding one’s way out of such 
amaze involve systematization of this process of probing with the objective 
of avoiding unnecessary repetition of the same route. (Some repetition 
is unavoidable. The existence of a blind alley cannot be anticipated; it 
can only be discovered. When a blind alley is found, retracing part of 
one’s route is necessary.) However, by marking the vertices and edges as 
they are encountered and probed, it is possible to formulate a procedure 
whereby no edge is traversed twice in the same direction, regardless of the 
structure of the maze. (It should be noted that the maze may be three- 
dimensional, in which case the associated graph may not be planar.) 


6-10 MATRIX FORMULATION AND SOLUTION OF THE 
CROSSING PROBLEM” 


We now illustrate an application of the vertex matrix to the solution 
of the missionary-cannibal river-crossing problem given in the preceding 
section. The conditions are that the boat carries no more than two people 
at a time and that at neither side should there be more cannibals than 
missionaries who out of habit would eat their outnumbered saintly men- 
tors. Let us consider the simple case of ferrying across the river a group 
of two missionaries and two cannibals. 

Prior to writing down a vertex matrix, we choose vertices correspond- 
ing to states at one side. Let us assume that the group starts on the left 
bank of the river. We now consider all possible states (within our two 
rules) on the left bank. A state is indicated by a pair of numbers the 
first of which indicates the number of missionaries and the second the 
number of cannibals. We have the following possibilities on the left bank: 


v; = (2,2) v4 = (1,1) vs = (0,1) 
¥_ = (2,1) v5 = (0,2) v7 = (0,0) 
vs = (2,0) 


Note that (1,0) is not allowed, since the corresponding state on the right 
bank would be (1,2) and the single missionary would be cannibalized. 
Similarly, the state (1,2) is not permitted on the left bank. We now form 
a vertex matrix whose elements are 0 or 1 depending on whether transi- 
tions from one state on the left bank to another also on the left bank are 
possible. The transitions, of course, are effected by the departure of the 
boat. Thus we list the vertices on the left and on top and enter a 0 or 1 
depending on whether it is possible to go from the state represented by 
the vertex on the left of the matrix to another state represented by a 
vertex on top. 
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We have the vertex matrix 


V1 V2 Us Ve V5 Ye UV: 
(Ow 1 F, 220.0) 
ror 0’ 1 1) 0) 1 -@ 
w|0 0 6 0, 0 O 1 

V=agel0r 0 or OO. 1 of 
He 0" 10 0 -0F 08 Fb «2 
pa/O) 6 0 0 0 0 FD 
7 (0 0 0 0 0 0 0 


On the right bank of the river one would have an identical set of states 
which are essentially the states complementary to those on the left bank. 
Their matrix would be V’, the transpose of the above matrix. The reader 
will readily verify without difficulty that to obtain the matrix of transi- 
tions after one round trip, one must take the product VV’. In general, 
for the transition matrix of m round trips we have (VV’)™, and since the 
object is to move the group to the right bank, we must multiply this by 
V. This gives (VV’)"V, and the problem is to find the number of round 
trips m such that the element in the 1, v7 position of this product is 1, that 
is, we have the transition (2,2) — (0,0) on the left bank. Thus the group 
has moved to the right bank. Note that in the successive multiplica- 
tions VV’, VV’V, VV'VV’", ete., elements whose value is more than 1 will 
appear, indicating the number of ways in which the corresponding transi- 
tion can be made. Since the object is to determine at each stage one 
possible transition, all nonzero elements are replaced by unity in these 
products. It turns out that our problem can be solved with m = 2 round 
trips and one last trip forward [i.e., a unit element appears for the first 
time in the 1, v7 position of (VV’)2V]. The computations are as follows: 


0 0 0 0) 0 


CoOoOewrF ORF CCOCOCOrFF}F 


ve 


= 
I 
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WV = (vv)? 


coorrorrer HoocoSoCS 
CORHFORH HH OCOCCOS 
Orrrrooooceoooce 
COP RRR RH CORR OHH 


ococoocooocr. COFr KKK OO 
eococoorr Oe OorRKH OO 
OreeeFPeEeH Hooococo 
CORPeP RR OCO ORRRH OS 
Cee e eee ORR RHEE 
ee ee 
CORR RH OO ORR HEH 
cooooooco sooocse 


ocooocorre 
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The problem now is to read off the solution from the matrices. We 
look at the last matrix, that is, (VV’)?V in the (v1,v7) position where there 
is a unit element, and we ask which possible nonzero element of the first 
row of (VV’)? could have given rise to this unit element in (VV’)*V on 
multiplication by the seventh column of V. One possibility is the element 
in the (1,4) position of (VV’)? (first row, fourth column), since there is also 
a unit element in the (4,7) position of V. Another possibility could be the 
element in the (5,7) position of V; but we choose the first. Thus the last 
transition is ys v7. We next ask where the unit element in the (1,4) 
position of (VV’)* could have come from. By examining the first row of 
(VV‘)V and the fourth column of V’, we find that it came from the fact 
that there is a nonzero element in the position (6,4) of V’ [since the cor- 
responding element from (VV’)V is also nonzero]. Thus the next-to-last 
transition is vs—> vs. Again we ask for the origin of the unit element in 
the position (6,1) of (VV’)V, and we find that it comes from the fact that 
the (2,1) element of VV’ and the (2,6) element of V are unity. Thus the 
third transition from the end is vz—> v5. We similarly find that the 
fourth and fifth transitions from the end are respectively v3 — v2 and 
v,;— v3. Thus the transitions are given by 


V1 — U3 U3 — V2 Ve — V6 v6 4 U4 07 


or simply 24, 03, 2, 6, V4, and v7. 

We could indicate alternative transitions if we wished. To interpret 
the solution in words, we note that, since vs is (2,0), both cannibals must 
first cross and one must return (because of v2), the two missionaries must 
cross in order to have vs on the left bank, one missionary must return so 
that the next state on the left bank will be v4, and finally both the mis- 
sionary and the cannibal must cross, giving the final state v7. 


Exercise 6-15 The states of the problem of three missionaries and three canni- 
bals where all missionaries and only one cannibal can row can be represented by 
(m,r,c), where 0 < m < 3,0 <r <1,0 <c < 2, mrefers to the missionaries, r refers 
to the rowing cannibal, and c refers to the remaining two cannibals. There are 16 
possible states for this problem. Write down its transition matrix. 


To determine whether a solution exists for a given problem without 
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going through all the manipulation needed to find the solution, the follow- 
ing method can be applied: 

The solution must take the form (VV’)"V. Suppose the problem 
begins at vertex v; and the ultimate goal is vertex v,. Then if a solution 
exists, there is an m such that (VV’)"V is nonzero in the (1,k) element. 
As shown above, to yield a 1 in the (1,4) element of (VV’)"V, there must 
be at least one nonzero element in row 1 of (VV’)™ which corresponds to a 
nonzero element in column k of V. So the problem reduces to determin- 
ing which of the elements of row 1 of (VV’)™ ever becomes nonzero for any 
power m. This can be tested easily even for very large matrices. Sup- 
pose the set {v.,%, . . . Ve} consists of the vertices corresponding to 1’s in 
the first row of VV’. These are the vertices which can be reached in one 
round trip from vertex v;. Add to this set all the vertices which have 1’s 
in row a, row b, ... , rowc of VV’. This new set consists of the ver- 
tices which can be reached from 1 in two round trips. Repeat this addi- 
tion for any new vertices in the set until it has been done for all vertices of 
the set. This final set consists of all the vertices which can ultimately be 
reached from vertex 7; in any number of round trips. Now, if this set 
contains any vertices which have nonzero elements in column k of V, the 
problem has a solution; otherwise, it has not. 

For an example, consider the four-missionary and four-cannibal prob- 
lem, where vertex = (number of missionaries, number of cannibals) : 


1 = 44) vs = (4,0) v8 = (1,1) on = (0,2) 
vo = (4,8) 9 = (3,3) 0 = (0,4) ve = (0,1) 
vs = (4,2) v7 = (2,2) 10 = (0,3) 18 = (0,0) 
v4 = (4,1) 


oi 


od 


The problem is to get from v, to 143. 
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In column 13, vs, v1, and v2 are nonzero. Therefore, if this problem 
has a solution, (VV’)" must have a nonzero element in the first row at 
Vg, Vix, OF Vio. 


= 
a 
= 
o 


Vir Vie 


18 
vy 
v2 
U3 
v4 
U5 
U6 

VW’ = 0; 
Us 
Ug 
Yr0 
Vn 
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oeooococococeoceos; 
oeoooooorocorces: 


0 
0 
0 
0 
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ooooocoorcerrros 
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The set of vertices which can be reached from 2; consists of {v1,v2}. v2 
can reach 7, v2, and v3; so add v; to the set, giving {v1,v2,v3}. vs can 
reach vo, 03, Vs, and v6; so add v4 and v6, giving {v1,v2,03,¥4,6}. v4 reaches 
vs and v4, which adds nothing new to the set. vs reaches v3 and vs, which 
also adds nothing new to the set. We have now exhausted the possibil- 
ities. Therefore, 7; can reach only 2, vo, v3, v4, and vs. It cannot reach 
Up, V7, Vs, V9, Vio, Via, V1, OF Vis in any number of round trips. But in column 
13 of V only vs, vi1, and vy2 are nonzero. Since none of these are included 
in the set, the problem is unsolvable. 

Remark: A transition (boolean) matrix may also be used to represent 
a directed graph associated with postulates introduced on a set of state- 
ments Z,, H,,..., E,. The postulates give relations of the form 
E;— E;. Note that we always have 2;— E;. By taking the second 
power of this matrix, we obtain a matrix in which each unit entry indicates 
either a postulate or a proposition which can be proved in two steps, for 
example, H,— E;, E;— Ey, jointly imply Hy— Ey. Unit elements in 
the third power of the matrix indicate postulates or propositions which 
can be proved in two or in three steps. Finally, the (n — 1)st power 
displays all propositions. An interesting problem is to find the smallest 
number of postulates from which a given set of propositions can be deduced. 
Note that there are at least 2" sets of equivalent postulates which yield 
n theorems,?%78 


Exercise 6-16 Show that the total number of n X n matrices whose e!ements 
are 0 or 1 and all of whose diagonal elements are unity is 2"*-. 
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6-11 TRIANGLE DIVISION 
PROBLEM 


Suppose that one divides a triangle 
ABC into smaller triangles by draw- 
ing n lines parallel to the sides (see 
Fig. 6-25, where n = 2) and then 
assigns letters A, B, and C to the 
resulting points as follows: Points 
on BC are labeled B or C but not 
A, and similarly for CA and AB. 
The points interior to the large tri- 
angle can be labeled by any of the Figure 6-25 

three letters. It is desired to show 

that the number of small triangles with three differently labeled vertices 
is odd, 

The proof begins by assigning the number 0 to a line segment whose 
end points are alike and the number 1 in the opposite case. The sides of 
a triangle whose vertices are labeled ABC sum to 3. This sum is 0 or 2 
in the opposite case. The sum along the segments on each side of the 
large triangle must be odd, as can be easily verified. The sum over all 
small triangles must be odd because the interior segments are counted 
twice. Hence, there is an odd number of triangles numbered 3. 


A B A B 


6-12 TWO-PERSON GAMES 


A directed graph is a natural mathematical model to employ when analyz- 
ing some types of competitive situations between two individuals or 
groups of individuals having conflicting objectives. The following brief 
discuss‘on does not attempt to present the most general context in which 
graph theory is applicable to such ‘‘games,” or to precisely identify con- 
cepts with those from the formal theory of games. 

Consider a situation in which two persons alternately make incre- 
mental modifications to a structure (such as a chessboard and its pieces). 
Assume that there is a standard starting “state’”’ (e.g., the positioning of 
the chessmen at the outset) and a “rule book” which completely specifies 
the list of permissible moves, i.e., incremental changes of state, for each 
player. If there are only a finite number of distinct game states, the rules 
of the game can be completely characterized by a finite directed graph 
with two categories of ares, Each state is considered as a vertex v;, and 
there exists an are of category 1 (or 2) from »; to »; if and only if the game 
can be transformed from state i to state j by a valid move by player 1 
(or 2). A complete play of the game consists of an are progression of 
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Figure 6-26 


arcs of alternating types starting at the appropriate initial state with an 
are of the type associated with the starting player. 

Suppose that the game is such that no state is ever repeated, i.e., the 
associated graph has no cycles. Then the number of individual moves in 
a complete play of the game is bounded. Assume, moreover, that to 
‘“vin” means to reach one of a specified set of states or vertices for the 
first time. For example, in the acyclic graph of Fig. 6-26 assume that y 
and z are the winning states and that z is the starting state. (In this 
example the set of permissible moves is assumed to be the same for both 
players, so that it is not necessary to specify two categories of arcs.) 

Remark: One must distinguish between a play of the game and the 
game itself. Thus, for example, a player, in more general games, can be 
in a “winning state” and lose the game, or for that matter, be in a “losing 
state” and subsequently win. The particular kind of game illustrated 
here can be called “two-person perfect-information perfect-recall finite- 
win-lose alternation game in extensive form” (to give a vague idea of how 
special is its category). This is perhaps the simplest kind of game involv- 
ing more than one player. 

Note that the graph has a set S of distinguished vertices (marked 
with asterisks in Fig. 6-26) having the following three properties: 


1. No two vertices of S are joined by an are. 
2. Every vertex not in S is joined to at least one vertex in 8S by an are. 
3. All winning states are in S. 


Properties 1 and 2 are sometimes referred to as internal and external 
stability, respectively. A set of vertices, such as S, which possesses both 
properties is called a nucleus or kernel. Now suppose that player 1 plays 
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first, that a nucleus S is known to player 1, and that the starting state is 
notin S. Then by property 2 player 1 can move toa state in S. If this 
is not a winning state, according to property 1, no matter what move 
player 2 makes it will lead to a state not in S (and hence by property 3 not 
a winning state). Another judicious move on the part of player 1 will 
return the game to S. By this procedure, play will ultimately terminate 
with player 1 the winner. 

Thus, in principle at least, if the structure of the game is completely 
itemized and a nucleus S exists and is located, player 1 has a winning 
strategy if the initial state isnot in S. (Ifit isin S, player 2 has a winning 
strategy.) In practice, of course, the structure of nontrivial games is too 
complex to actually itemize and consider the entire graph. However, if 
the game rules are such that the graph—though very large—has a highly 
systematic structure, it is sometimes possible to devise techniques which 
in effect generate elements of S as required in the vicinity of the current 
game state. 

Remark: For the reader who is familiar with the game of Nim and of 
the strategies for winning, based on binary manipulations, these manipula- 
tions actually amount to determining whether the current state, i.e., the 
remaining numbers of sticks in the various piles, is in S and finding an 
appropriate transition into S if it is not. 


A Switching Game 

The solution of the following game, formulated by C. E. Shannon, is 
due to Lehman.®* Two individuals, a ‘‘cut” player and a “short” player, 
playagameonagraph. They single out two vertices called the terminals, 
and the cut player will make a move by deleting an edge of the graph 
while the short player in his turn will make a move by designating an edge 
which cannot be deleted. Thus alternately the cut player deletes an 
edge and the short player states an edge which cannot be subsequently 
deleted. The object for the short player is to maintain a chain between 
the terminals while the cut player attempts to remove all chains. A game 
in which the cut player can win on going first or second in his deletions is 
known as a cut game, whereas a game in which the short player can win 
by going first or second is known as a short game. A game is neutral if 
whichever player goes first can win. Here we only give the condition for 
a short game: 

Theorem 6-8 A game is a short game if and only if the graph con- 
tains two edge-disjoint trees whose vertices are the same and include the 
two terminals. 

The condition is sufficient. If the short player can win by playing 
second, he can obviously win by going first. His move must always 
consist of preserving an edge which connects the two components of the 
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trees produced by the deletion of an edge by the cut player. Since both 
trees have the same number of edges, he can always connect the resulting 
components by using the other tree. Necessity is harder to prove and will 
not be shown here. 

A version of the switching game, known as “Gale” and, in commercial 
form, as Bridg-it, is described in Ref. 27 and is also treated in Ref. 58. 
The graph involved is essentially the graph depicted by the bold ares in 
Fig. 6-27. In order to make the description of play more symmetric, the 
cut player makes moves on the light graph shown in the figure, while the 
short player makes moves on the bold graph. The former seeks to con- 
struct a chain joining w’ and w’’, while the latter seeks to construct a 
chain joining v’ and v’’. One “move” by cither player consists of desig- 
nating one of his edges which was not previously designated (initially, no 
edge is designated) and which does not cross an edge already designated 
by his opponent. Except for a minor technicality, the bold and light 
graphs are duals of each other. (What is the technicality?) 

For this particular switching game, O. Gross devised a simple winning 
strategy, described in Ref. 27. The game is a neutral game, and the 
strategy is a “pairing strategy,” in which the selection of an edge by the 


ee 


ye 


v 


Figure 6-27 
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second player is always countered by the selection of a specific correspond- 
ing edge by the first player (except if when the corresponding edge has 
already been played). 


6-13 APPLICATION TO BOARD GAMES™+:" 


Suppose we are given a set of squares of a chess board, as illustrated in 
Fig. 6-28, where from an even-numbered square it is possible to move one 
square vertically or horizontally and from an odd-numbered square it is 
possible to move one square diagonally. By associating a vertex with 
each square, one can form the vertex matrix V of the associated graph. 


Exercise 6-17 Determine by examining the element in the seventh row and third 
column of V‘ the number of ways one can move from 7 to 3 in four moves; then obtain 
the (7,3) element from V + V? + + - - + V* to accomplish the same task in less than 
six moves, and finally from the d'agonal elements of V* obtain the number of ways a 
piece can return to its starting position in five moves. 


The reader may be familiar with various puzzles having the general 
objective of precisely covering a given plane “board” with a set of plane 
“pieces” whose total area equals that of the board to be covered. In 
the case where each piece has only a finite set of permissible board posi- 
tions, this problem can be phrased in graph-theoretic terms and solved 
(at least in principle, and in some cases in fact) in such terms. The 
following brief discussion outlines this approach to such problems and is 
based on work by P. Jullien.5* 

Assume, for the sake of concreteness, that the board, as well as each 
of the pieces, consists of a rectangle whose dimensions are integers. 
(Much more general assumptions could be made.) Suppose, moreover, 
that the board has been subdivided into unit squares and that the only 
permissible positions for any given pieces are those in which no squares 
are partially covered. 


With this understanding, we 
can itemize all the permissible 
board positions of each piece. 
Having done so, two board posi- 
tions will be considered compatible 
if and only if they correspond to 
two different pieces and do not 


have any overlapping area. 
Let & denote the number of 
pieces, and let n; represent the num- 
ber of permissible positions of the 
ith piece. Consider the graph G Figure 6-28 
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k 
having 2 n,; vertices, corresponding to all possible positions of all pieces, 
1 


and such that vertices v and w are adjacent if and only if the correspond- 
ing positions are compatible in the above sense. It is readily seen that 
every solution to the puzzle corresponds to a complete subgraph having k 
vertices, and conversely. Thus the task of finding all solutions to the 
puzzle is equivalent to that of finding all complete subgraphs having 
k vertices. 

For the remainder of the discussion, let A, B, . . . , Z denote the 
pieces, with A; denoting a particular position (i.e., vertex) of piece A, 
Jullien describes a succession of transformations which start with the 
graph of the puzzle and which end with a graph whose vertices are the 
various solutions, i.e., the complete subgraphs of the original graph. 

As the first step, all vertices corresponding to pieces A and B (where 
these may be any two of the pieces) are removed, being replaced by 
vertices of form A B,;, restricted to those ij combinations such that A; and 
B; are compatible. In addition, AB;; is joined with positions of pieces 
C, D, . .., 4 whenever those positions are compatible with both A; 
and B;. It may be easily seen that there is a 1-1 correspondence between 
complete graphs with k vertices in the original graph and complete sub- 
graphs with k — 1 vertices in the new graph. 

By repetition, one finally arrives at a graph whose vertices are of 
the form ABC... Z; .. . q, signifying that A;, B, ... , Z, deter- 
mine a complete subgraph of the original graph of the puzzle. (Can you 
fill a 12 X 12 X 12 cube with 2 X 4 X 8 bricks? Prove.) 


MATCHINGS 


6-14 MAXIMAL MATCHINGS AND RELATED TOPICS 


So far, matching is a theoretical subject. It is an excellent example of a 
mathematical theory that needs an application. Generalizations of the 
basic problem considered here are exemplified by the section dealing with 
the interconnection of electric power stations. 

Let G = (V,2) be a graph having no loops. A set of edges MCH is 
called a matching in G if no two edges of M are adjacent. Thus each 
vertex of G is incident with at most one edge of M. A vertex is said to be 
either covered or exposed, relative to M, depending on whether or not it 
meets an edge of M. The empty set is a legitimate (though uninterest- 
ing) matching relative to which every vertex is exposed. 

A matching in @ will be called a maximal matching in G if no matching 
has higher cardinality. If every vertex is covered, the matching is said 
to be perfect, and it is sometimes called a 1 factor. A perfect matching, 
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if one exists for G, is clearly maximal. (Note that no perfect matching 
can exist if |V| is odd.) 

Tutte® has characterized the set of graphs having perfect matchings. 
The main objective of this section is to describe a recent algorithm due to 
Edmonds” for finding a maximal matching in an arbitrary graph. In 
particular, it will determine a perfect matching if one exists. In contrast 
to other known approaches to solving this problem for general graphs, the 
maximum effort required to implement this algorithm increases alge- 
braically, rather than exponentially, with the number of vertices of G. 

Suppose that @ is bipartite, having a vertex partition {V:,V2} such 
that every edge joins a vertex in V; to a vertex in Vz. Matching prob- 
lems often arise in the context of bipartite graphs, particularly when the 
vertices in V; and V» represent different kinds of entities (e.g., men vs. 
women, men ys. jobs, jobs vs. machines). In such settings one frequently 
wishes to “‘pair’’ or “match” the two types of entities in such a way that 
there are as few as possible residual unmatched entities (i.e., exposed 
vertices). The structure of the original graph serves to itemize all poten- 
tial or feasible pairings. 

Given a matching M in a graph G = (V,£), a simple chain C in G is 
called an alternating chain, relative to M, if (when the chain is traversed 
from one end to the other) its edges are alternately matching edges (edges 
of M) and nonmatching edges (edges of H — M). Given a matching 
and an alternating chain C, consider the edge set M’ = M © C consisting 
of those edges appearing in M or in C but not in both. Expressed differ- 
ently, M/’ is obtained by deleting from M the matching edges in C and 
adding to M the nonmatching edges in. Since the number of matching 
and nonmatching edges in C differs by at most 1, the number of edges in 
M and M’ also differs by at most 1. 

To illustrate the preceding remarks, consider the matching M indi- 
cated by the bold ares in Fig. 6-294. The set M’ = M ® C shown in 
(c) is obtained by taking as C the alternating chain shown in (6). If, as 
in the preceding example, the end points of alternating chain C are exposed, 
relative to M, then it is readily seen that J’ is necessarily a matching and 


(a) (b) (c) 
Figure 6-29 
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that |M’| = |M|+ 1. For this reason, an alternating chain both of 
whose end points are exposed is called an augmenting chain. ‘The exist- 
ence of an augmenting chain is a necessary, as well as sufficient, condition 
for M to benonmaximal. Stated formally, we have the following theorem 
due to Berge,* with proof along the lines of Hdmonds:*° 

Theorem 6-9 A matching M in a graph G is maximal if and only 
if G contains no augmenting chain relative to M/. 

Proof: To complete the proof, it remains to show that if M is not 
maximal, then there exists an alternating chain joining two exposed ver- 
tices relative to M. (The converse has already been established.) 

Assume that matching M’ has one more edge than // has, and consider 
the graph G’ = (V’, M ® M’), which includes precisely those edges which 
occur in M or in M’ but not in both. No vertex v of G’ has degree greater 
than 2, since at most one edge in M, and one in IM’, is incident with v. 
The components of @’, other than isolated vertices, correspond to alter- 
nating chains and alternating simple circuits relative to M (and to M’). 
Each of the latter has the same number of edges from M/ as it has from M’. 
Hence there must be at least one component which corresponds to an 
alternating chain and which has one more edge from M’ than from M. 
The end points of such a chain are necessarily exposed vertices relative to 
M. This completes the proof. 

In view of the preceding theorem, the problem of finding a maximal 
matching can be solved if one has an acceptable method for finding an 
augmenting chain whenever one exists. or we could simply start with 
the “empty” matching as Mo, find an alternating chain C, (a single edge, 
in this case), define M, = My @ Ci, and in general define M; = Mi. ® Ci, 
where C; is an alternating chain joining two exposed vertices relative to 
M;. Eventually, a stage 7 for which no augmenting chain C;1 exists 
will be reached, M; is then maximal. 

Effective algorithms for finding maximal matchings in bipartite 
graphs have been known for some time; see Refs, 22 and 49 for a descrip- 
tion of several of them. The algorithm to be described, which is applicable 
to any graph, systematically searches for augmenting chains and sub- 
sequent enlargements of the matching. In the process it identifies cer- 
tain regions of the graph (associated with specialized trees called hun- 
garian trees) which can be eliminated from further searching without 
danger of overlooking any additional augmenting chains. From time to 
time it may also shrink (or contract) certain subgraphs (associated with 
appropriate circuits of odd length) to a single “pseudovertex,” thus 
simplifying subsequent search. 

Before describing the algorithm, several auxiliary concepts”? will be 
introduced. An alternating tree is a tree in which the vertices have been 
partitioned into two classes, called inner and outer vertices, in such a way 
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(a) 
Figure 6-30 


that every inner vertex is of degree 2 and every edge joins an inner vertex 
with an outer vertex. (Note that all “end vertices,” i.e., vertices of degree 
1, are necessarily outer.) Figure 6-30a is an example of an alternating 
tree (the bold vertices being the outer ones). An alternating tree neces- 
sarily has precisely one more outer vertex than it has inner vertices. To 
show this, note that every edge of the tree is incident with precisely one 
inner vertex. Hence if there are m edges, there are m/2 inner vertices and 
(m + 1) — m/2 = m/2 + 1 outer vertices. 

In order to determine a maximal matching in an alternating tree, 
select any outer vertex as vp and define d; to be the number of edges in 
the tree chain joining vp with any other vertex v;, with do = 0. (This 
is illustrated in Fig. 6-30b.) Every inner vertex v; is adjacent with pre- 
cisely one outer vertex v; such that d; = d; + 1 (that is, v; lies “between” 
vp and »;), The set of edges joining all such pairs »; and v; is a matching. 
It is necessarily maximal, since only vp is exposed. Corresponding to 
every outer vertex there is a maximal matching leaving only that vertex 
exposed, and every maximal matching of the alternating tree is of this 
type. 


Exercise 6-18 Prove that the set of edges defined in the preceding paragraph is 
necessarily a matching. 


Exercise 6-19 Prove that the vertex left exposed by a maximal matching in an 
alternating tree could not be an inner vertex. 


Exercise 6-20 Prove that a maximal matching in an alternating tree has the 
property that the tree chain joining the exposed vertex with any other vertex is an 
alternating chain. (Other vertices do not have this property in general.) 


Given a matching M in a graph G, a planted tree in G is an alternating 
tree T in G such that all matching edges which are incident with vertices 
of T are edges of the tree and such that these edges constitute a maximal 
matching for 7. The (unique) exposed vertex of T' is called the root 
of planted tree 7’. 
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Let T be a planted tree in G, 
relative to matching M, and let vo 
be its root. If there is an edge of G 
joining an outer vertex of 7 with 
an exposed vertex of G other than 

Blossom vo, then 7’ is calied augmenting rela- 
Figure 6-31 tive to e. Clearly, an alternating 
chain joining two exposed vertices 
exists, so M can be enlarged in the manner described earlier. 

A flowered tree is a planted tree 7 together with an edge of G joining 
two outer vertices of 7. If C; and Cy denote the alternating chains 
joining these vertices with vp, then C; U C2 is called a flower, C1: C2 
is called a stem, and C, ® C: is called a blossom. Figure 6-31 illustrates 
these concepts. (If the root of the tree is incident with the blossom, 
there are no edges in the stem.) The vertex where the stem meets the 
blossom is called the tip of the stem. 

Let T be a planted tree in G relative to matching M, and suppose 
that some edge e joins an outer vertex v of 7’ to a covered vertex w not in 
T. Then the edge f which covers w is not in 7, nor is its other end point x. 
(Why not?) In this case, we can enlarge 7 by adding edges e and f and 
vertices w and x (as inner and outer, respectively). 

Note that all of the foregoing three concepts (an augmenting tree, 
a flowered tree, and a tree which can be enlarged) are related to con- 
sideration of a planted tree and an edge meeting one of its outer vertices. 
They differ with respect to the other end point of the edge under con- 
sideration (whether it is in the tree as an outer vertex or whether it is 
not in the tree, either exposed or covered). Naturally, the same planted 
tree may, with respect to different edges, meet several of these conditions. 
If it does not meet any of these conditions with respect to any edge inci- 
dent with any of its outer vertices, we have the concept of a hungarian 
tree. A planted tree 7 in G@ is called a hungarian tree if every edge of G 
which is incident with an outer vertex of T joins it to an inner vertex of 7’. 
Thus T is not an augmenting tree or a flowered tree relative to any edge e 
of G, nor can it be enlarged as in the preceding paragraph. 

The following theorem is of central importance for the algorithm 
for finding a maximal matching in a graph: 

Theorem 6-10 A planted tree T in G@ can be enlarged to either an 
augmenting tree, a flowered tree, or a hungarian tree. 

To see this, we need only consider the definitions and look for an 
edge e of G relative to which 7 is augmenting or flowered. If there is 
no such edge, and if 7’ cannot be enlarged, then 7’ is necessarily hungarian. 

We are now in a position to describe the algorithm for finding a 
maximal matching. Initially, take any matching M in G and any planted 


A variety of interesting applications 175 


tree T' relative to M. (We can take as M the empty set and as 7 any 
vertex.) 


1. If T is augmenting relative to some edge e of G, use the corre- 
sponding alternating chain to enlarge M to a matching M’ having one 
more edge. If M’ is not perfect, pick a new planted tree 7 (e.g., any 
exposed vertex). 

2. If 7 is flowered relative to some edge e, shrink the vertices and 
edges of the blossom to a single new pseudovertex and remove any loops 
created. 

3. If T can be enlarged by adjoining two new vertices and edges in 
the manner described earlier, enlarge it. 

4. Continue steps 1 to 3 until either a hungarian tree or a maximal 
matching is found (in the transformed graph in which a number of 
blossoms may have been shrunk), 


In the former case, if the hungarian tree 7 contains all vertices of 
the transformed graph G*, then the current matching M is clearly maxi- 
malin G*. If it does not, remove from G* the vertices of T and all edges 
incident with these vertices and repeat steps 1 to 3 for the remaining 
graph. In this manner we gradually remove one hungarian tree after 
another until either we have a perfect matching in the last remaining 
portion of the graph or else the final, remaining portion is hungarian. 

The maximal matchings in these several portions are now used to 
reconstruct, a maximal matching for the original graph by reexpanding 
the previously shrunken blossoms (in reverse order from the order of 
shrinking) and adding appropriate edges to the matching. If the blossom 
has 2n + 1 vertices, then n edges of the blossom are added to the match- 
ing in the one possible way which does not involve an edge incident with 
the tip of the stem (which may already be incident with an edge of the 
matching). 

The algorithm for finding a maximal matching will be illustrated 
with the graph of Fig. 6-32a. Throughout the discussion, the edges 
and root of the current planted tree will be shown boldly, and the edges 
in the current matching will be denoted by asterisks. 

Initially, we take as M the empty set and choose any exposed vertex, 
say, F, to start a planted tree 7. 1 is augmenting relative to edge 13, 
so we add 13 to M and start a new planted tree at C, as shown in Fig. 6-32b. 
Relative to the new 7’, edges 11 and 13 meet the conditions for enlarging 
T, asin (ce). Relative to edge 10, the current planted tree is augmenting. 
We enlarge M, discard 7, and start a new tree at H, as shown in (d). 
By two augmentations (based on edges 6 and 7) we enlarge 7’, as in (e). 
Now 1 is augmenting relative to edge 12. We enlarge M and start a 
new tree at 7, as shown in (f). Two enlargements (based on edges 8 
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and 9) yield the tree shown in (g). This tree is hungarian. Its outer 
vertices (C, H, and J) meet only inner vertices of the tree. We next 
remove the tree temporarily from consideration and consider the sub- 
graph G; obtained by eliminating the vertices of the hungarian tree and 
all edges incident with these vertices. This subgraph is shown in (h). 
Here we have chosen a new root A and enlarged the tree, based on edge 1. 
Relative to edge 2, this tree is augmenting, and it yields a new matching 
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Figure 6-33 


in G, consisting of the two edges shown in (7). This matching is clearly 
maximal (in fact perfect) in G; When combined with the maximal 
matching in the hungarian tree found earlier, it constitutes (for reasons 
discussed later) a maximal matching for G, shown in (j). The present 
example did not require shrinking of any blossoms. 

In more complex graphs, it is necessary to find and successively 
remove several hungarian trees 7, 7, . . . and hence consider several 
residual subgraphs G,, G2, . . . before finally arriving at a subgraph in 
which we find either a perfect matching or else a hungarian tree which 
contains all remaining vertices. By successively restoring the hungarian 
trees in reverse order and assigning a maximal matching to each as it is 
added, a maximal matching for the entire graph is obtained. 

To illustrate the shrinking and subsequent reexpansion of blossoms, 
suppose that, in addition to the edges shown in Fig. 6-32a, vertex C’ was 
incident with the structure shown in Fig. 6-33a. Relative to the planted 
tree and matching edges shown in (a), the circuit C, P, Q, T, N is a blos- 
som; and when shrunken, it yields (b). Relative to the new planted tree 
in (b), the remaining graph is a blossom, and it can be shrunken to the 
single pseudo-pseudovertex shown in (c). Reexpanding blossoms in 
reverse order and recalling the matching edges destroyed in the shrink- 
ings yields the maximal matching shown in (d). 

Assuming that vertex C in our original example was actually the 
above pseudo-pseudovertex (i.e., as- 
suming that this double shrinking 
had preceded the considerations 
shown in Fig. 6-32), a maximal 
matching for the total graph is as 
shown in Fig. 6-34. 

Exercise 6-21 Find maximal match- 


ings for the graphs shown in Figs. 6-15 
and 6-17. 


Thus far we have introduced 
the terminology and preliminary re- 
sults required to describe the algo- Figure 6-34 
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rithm and informally illustrated the algorithm by means of a limited 
example. We shall now restate the algorithm in general terms. 

Statement of the algorithm Let G = (V,EZ) denote the graph 
for which a maximal matching is to be found and let M denote an initial 
matching in G. (The empty set will do.) Let S denote a set of edges, 
initially empty, which will subsequently be enlarged by adding certain 
edges. If any vertices of G are exposed relative to M, select one of them 
and regard it as a planted tree 7. Consider, in any order, the edges 
which are not in 7 but which meet outer vertices of T. If T is flowered 
relative to an edge, shrink the corresponding blossom to a single pseudo- 
vertex, record the blossom involved, and henceforth regard the pseudo- 
vertex as an outer vertex of 7. If 7 can be enlarged by adjoining two 
edges and vertices as described earlier, enlarge it. If 7’ is augmenting 
relative to some edge, use the associated augmenting chain to enlarge M. 
(In this case 7 is no longer a planted tree. Start a new planted tree by 
selecting—if one exists—an exposed vertex relative to M.) 

If a point is reached at which the present planted tree 7 is hungarian, 
and if not all vertices of the graph under consideration are in 7, tem- 
porarily eliminate all vertices of 7 and all edges which meet these vertices. 
Denote by G, the remaining graph, and repeat the above process for 


this graph. 
Ultimately, after removal of a finite number of hungarian trees 
T;, T2, . . . , T, and consideration of the corresponding residual graphs 


G,, Gs, .. . , G, a point such that either (1) @, is empty or (2) M 
includes a perfect matching for G, is necessarily reached. At this point 
restore T,, T,1, . . . , T1, in turn, with a maximal matching for each. 
If any vertices are pseudovertices, reexpand them. (A safe way to do 
this is in the reverse order from their order of shrinking.) As each vertex 
is expanded, assign to the corresponding blossom the maximal matching 
which does not meet the vertex which was the tip of the stem at the 
time of shrinking. After all blossoms have been expanded, the resulting 
matching is maximal for the original graph. A complete validation of 
this algorithm is presented in Ref. 20. 

Let G = (V,E) be a graph each of whose vertices v has an associated 
nonnegative integer d(v). By a degree-constrained subgraph of @ is meant 
a subgraph G’ = (V,/’) such that the degree of v in G’ is at most d(v) 
for every vertex v. By a maximal degree-constrained subgraph is meant 
one having the largest possible number of edges. [The reader will recog- 
nize the maximal matching problem as corresponding to the case when 
d(v) = 1 for every vertex v.] 

Edmonds has developed a straightforward generalization of the 
foregoing algorithm to solve the general problem of finding a maximal 
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degree-constrained subgraph for arbitrary nonnegative d(v)’s. He has 
also considered a second important generalization of the maximal match- 
ing problem. It is that of finding a matching whose total “weight” is 
maximal, where each edge of the graph has an associated numerical 
“weight.” (The basic matching problem corresponds to the case when 
all weights are unity.) It is possible to combine an algorithm’ for solving 
this problem with one for finding shortest paths (as in Chap. 3) to solve 
the Chinese postman’s problem, first studied by Kwan.** It is to find a 
closed edge progression which includes every edge of a connected graph 
at least once, and which has minimal total weight. (Thus the problem 
is to duplicate enough edges to obtain a unicursal graph, adding minimal 
weight in the process.) 


SOME ENGINEERING APPLICATIONS 


6-15 ANALYSIS OF PHYSICAL SYSTEMS 


In this section we shall discuss the way in which an appropriate directed 
graph can be used to assist in deriving essential information about a 
physical system as a whole, given information about the characteristics 
of its component parts and the manner in which they are interconnected. 
The method to be discussed, developed, for example, by Trent,** is per- 
haps most widely applied to electrical circuitry, but it is equally applicable 
to systems involving other manifestations of energy employing, for 
example, translational or rotational mechanical devices or fluid devices. 
In addition to “pure” systems, the technique can be extended to “mixed”’ 
systems in which several energy types occur in various system elements 
and are interrelated through appropriate “coupling” devices. 

Consider a collection of m two-terminal devices or system elements 
By, .. . , Bn whose terminals are joined, in some manner, at n junction 
points P;,..., P,. These might be, for example, a collection of 
resistors, capacitors, and inductors, together with one or more voltage 
generators (in the simplest instance batteries, in other cases time-depend- 
ent devices). It is assumed that each individual system element can be 
adequately characterized by a known equation which relates two funda- 
mental variables, a through variable x; and an across variable y;, which are 
associated with the clement 2; and measured in a specified direction. 
‘The choice of variables and the use of the terms “through” and “across” 
will be clarified shortly. 

For example, if 2; and y; denote current and potential difference, 
respectively, each passive element (element other than a generator) may 
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have a characterizing or constitutive equation of the form 


th = ka; resistor 
d ; 
wW=k a a; inductor 


t * 
W=k f. _ ti dt capacitor 


where ¢ corresponds to time. An active element, or generator, is char- 
acterized by an equation expressing just one of the basic variables as a 
(possibly constant) function of time. For example, 


yi = FO 


characterizes a voltage generator. 

Assume now that an are a; is associated with each element E, and 
that a vertex v, is associated with each junction point P;. If the end 
points of the arcs are taken to be the appropriate junction points, the 
resulting directed graph provides a convenient characterization of the 
structure of the corresponding physical system. The essential charac- 
teristic of the through variables, in the present context, is that they 
satisfy the following postulate at each vertex: 

Vertex postulate The algebraic sum of the through variables 
associated with the ares incident with any given vertex is zero. 


By algebraic sum is meant the following: each through variable is 
added or subtracted, depending on whether the corresponding are is 
positively or negatively incident with the vertex under consideration. 
In Figure 6-35, for example, the postulate is satisfied at v;, since 


2) NCA 3 an) 


and it is readily seen that the postulate is satisfied at the other vertices 
as well. In the context of electrical networks, this is Kirchhoff’s current, 
law. In general, one must choose as one basic variable, namely the 
through variable, one with appropriate physical dimensions so that the 
vertex postulate is satisfied. 

The across variables are also assumed to satisfy a basic postulate, 
in this case related to circuits rather 
than to vertices. 

Cireuit postulate The alge- 
braie sum of across variables associ- 
ated with the ares of any simple cir- 


% cuit is zero. 


In this case the circuit is as- 
Figure 6-35 sumed to be oriented (in either of the 
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Figure 6-36 Figure 6-37 


possible ways) and each across variable is added or subtracted, depending 
on whether the corresponding are has a direction which agrees or disagrees 
with the induced orientation. In Fig. 6-36, for example, the postulate 
is satisfied for the oriented simple circuit C, since 


EV Scent ie OY lt 


It can also be seen to be satisfied for each of the five other simple circuits 
of this particular graph. For electrical networks, the reader will recog- 
nize Kirchhoff’s voltage law. 

The circuit law has the following alternative formulation: If » is 
a fixed vertex and »; is any other vertex, then the algebraic sum of across 
variables associated with any chain oriented from »; to v; is independent 
of the particular chain selected. (It is assumed here that the graph is 
connected, so that at least one such chain exists.) Using this formula- 
tion, we can associate a quantity S; with each vertex »; as follows: Assign 
S, arbitrarily, and for other j set S; = S; — K, where K is the algebraic 
sum of across variables along any chain oriented from v, to v;. By setting 
S, = 3 in the preceding example, we obtain the values of S; shown in 
Fig. 6-37. The across variables determine the values of S; to within an 
additive constant, and one may choose any convenient reference vertex. 
In an electrical context, the values of S; can be considered as potentials 
relative to the reference potential established at the reference vertex. 
The values of the across variables are then seen to be differences in 
potential. 

The process for deriving equations which characterize the system 
as a whole from the constitutive equations of its elements, together with 
the system structure, proceeds in two stages. First, the vertex and 
circuit postulates are exploited to reduce the totality of through or across 
variables to a (relatively) small set of independent variables in terms 
of which all the variables can be expressed. The constitutive equations 
are then brought into play to interrelate through variables with across 
variables. We now turn to the first stage of this process. 
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When applied to a particular vertex v;, the vertex postulate asserts 
that 


m 
Y aya; = 0 
i=1 


where a;; = +1(—1) if the ith vertex is positively (negatively) incident 
with the jth are and is zero otherwise. Expressed differently, the vectors 
Ai = (ai, - - « , im) 

and Ke ee Gye Fx 5 Sma) 

are orthogonal. Note that A; is a row of the graph’s incidence matrix A. 
Since the space spanned by the rows of A js the same as the space spanned 
by the rows of the cut-set matrix K, it follows that X’ is a linear combina- 
tion of circuit vectors (rows of the circuit matrix C) since A (or K) and C 
determine orthogonal subspaces which jointly span m-dimensional space. 

Referring to material from Chap. 5, K and C ean be written as 


RK = (Kil) 
and é = (16,2) 
where the J’s are identity matrices, by choosing as the first m — n + 1 


columns the chords of a spanning tree. By partitioning the vector of 
through variables in the same way, we have 


Xe 
x= (x) 


where X, and X, correspond to the chords and branches, respectively. 
The vertex postulate then implies that 


vA x 
(Rull) ($2) = 0 
Hence X= —KuX, = CX. 


and the branch through variables have been expressed in terms of the 


chord through variables. 
In a similar way, the circuit postulate leads to the matrix equations 


(1G) (y) -0 
and Y. = —CnY, = KiY, 


expressing the chord across variables in terms of the branch across 
variables. 

Application of these relationships constitutes the first stage of the 
analysis: We have reduced to a minimum the number of through and 


A variety of interesting applications 183 


across variables which must be explicitly considered. (The particular 
variables selected depend, of course, on the selection of a spanning tree.) 

The constitutive equations of the elements can be conveniently writ- 
ten in matrix form as follows if the across variables are explicit functions 
of the through variables: 


Y=0,.X— ¥, 


where 2, isan m X m diagonal matrix whose ith diagonal element is either 
a constant or a differentiation or integration operator and Y, is a column 
vector which is zero in positions corresponding to passive elements and is 
an appropriate function f() for generators. (The corresponding diagonal 
entries of ©, are 0.) 

If the through variables are expressed as explicit functions of the 
across variables, we obtain a comparable expression 


X=9,Y-—X, 
The former expression can be rewritten as 
aOx.=Y + Y, 
Premultiplying by € yields 
EVEX, = (CY + CY, = CY, (6-1) 


in which the only unknowns are the chord through variables. 
The latter expression can be rewritten as 


0,K'Y, = X + X, 
and then as 
RORY, = RX RX, = RY, (6-2) 


in which the only unknowns are the branch across variables. 

Equations (6-1) and (6-2) correspond to the circuit (or mesh) and 
vertex (or nodal) formulation techniques, respectively. In either case, 
if the resulting set of simultaneous equations can be solved by using 
appropriate mathematical tools, the remaining through and across vari- 
ables are readily obtained from relationships given above. Note in par- 
ticular that K’ and C’ can be written down by inspection from the graph, 
once a tree has been chosen. 

When some system elements have more than two terminals, or when 
some elements serve as “couplers” relating different energy types in the 
same system, the matrix characterizing the constitutive equations has a 
more complex structure and the resulting simultaneous equations are cor- 
respondingly more difficult to solve. The role of the system graph is 
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fundamentally the same, however. For extensive discussions of this 
topic, from various points of view and with varying degrees of emphasis 
on the system graph, the reader is referred to the following books listed at 
the end of Chap. 1: 14, 15, 17, 22, and 27. 


6-16 COMMUNICATION NETWORKS 


We first consider some qualitative aspects of communications between 
members of a group. The members of a social group can communicate by 
several means, e.g., orally, by writing, or by gesticulation. These means 
of expression when used by members of the group for communication give 
rise to a network for that group. The network in this case is a graph 
whose vertices represent the members of the group and whose edges 
(called communication channels) indicate whether two members of the 
group are in a position to communicate directly. 

The directed graph of Fig. 6-38 indicates whether individuals or sta- 
tions, represented by vertices, can communicate with each other, the 
arrows characterizing directions in which messages can be sent. The 
vertex matrix of the graph is given by 


Vv; Ve Vs Us V5 

vy JO J 08 0 O 
3,020 Ws TO 
V=u10 0 0 0 0 
VEO” (Oe 8OF 

ve \0 0° 0 10) 0 


Exercise 6-22 Answer the following questions from the matrix. 

(a) With how many stations does each station communicate directly? 

(b) How many stations communicate directly with each station? 

(c) In how many ways does each station communicate with any other through 


one intermediate stat’on? 
(d) Can every station communicate directly or indirectly with every other station? 


In a communication network assume that a subset of communica- 
tions (direct through ares or indirect through paths) between the vertices 
is prescribed as a necessary condition for the performance of a task. It 
may then be desired to determine 
whether it is possible to carry out 
% Dy the task by means of the given net- 

work. Generally it may be possible 

to effect the communications in sev- 

vs eral ways. Each such way is known 

“8 as an organization. An optimal or- 
Figure 6-38 ganization may be chosen according 


we 
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to some criterion, e.g., minimization of the global cost of operation using 
the organization. 

Remark: An interesting measure defined on a strongly connected 
graph is the centrality index, which is a measure of the disparity between 
the vertices of the graph. If we define a deviation matrix whose coeffi- 
cients mj; give the minimum length of a path from vertex v; to vertex v; 

n 


and if we form y m4, then the centrality index of vertex v; is given by 


j=l 
" 
% may 


igs 


y mi 
j=l 
The global index of the graph is obtained by summing over ¢. 


Exercise 6-23 Compute the centrality indices and the global index of several 
graphs. Compare with the concept of radius. 


Synthesis of a Communication Network 


We now consider questions in which the quantity of communications 
is important. In a communication network (a connected simple graph) 
one may prescribe a numerical capacity on each edge to reflect the fact 
that the total communication load between any two points (cities, for 
example, in telephone communication) may be limited. In this manner 
we obtain a vertex-by-vertex matrix B, known as the branch capacity 
matrix, whichis asymmetric matrix. Each element of B gives the capacity 
of the edge incident with the two vertices corresponding to the entry. 
Every main diagonal element is assigned the same constant value d, which 
is left arbitrary. From B one may determine a terminal capacity matrix 
T, another symmetric matrix giving the maximum possible communica- 
tion flow between every pair of vertices. Its entries are obtained by 
taking all the cut-sets separating the two vertices regarded as terminals, 
computing the sum of the capacities of the edges in each cut-set, and 
taking the minimum of these capacities. 

The present discussion anticipates certain results which are rigorously 
established in Chap. 7. In particular, it is shown there that the max- 
imum feasible flow between two specified vertices, of a directed graph, is 
equal to the capacity of a minimal-capacity cut-set separating these 
vertices. Moreover, a systematic method for locating such a critical cut- 
set is presented. Thus B gives rise to a unique 7’, One question is to 
determine when a symmetric matrix is realizable as a terminal capacity 
matrix 7. Another is to obtain in a systematic manner a B from a given 
T (there can be several B’s) such that the sum of the branch capacities is 
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as low as possible. This is known as the systematic synthesis of a com- 
munication network having given terminal characteristics. We now dis- 
cuss this realizability problem. 

Consider the cut-set corresponding to the minimum entry f; in 7. 
This cut-set divides the graph into two components C, Cz and enables 


the partitioning 
[= T., T(t) 
Tb)! eT, 


where 7’,, and 7’, are the terminal-capacity matrices for C; and C2, and 
T(t) and its transpose 7’(t:) give the entry (everywhere ¢,) indicating 
connection between pairs of vertices, one in each component (with a value 
t, the capacity of the cut-set). 

We describe the following process given’ as a necessary and suffi- 
cient condition for realizability. Rearrange the matrix as 7’ above, 
where ¢;; > ¢; and 7, and T,, are terminal-capacity matrices; i.e., two 
graphs whose cut-sets have the capacities indicated by these matrices can 
be constructed and then joined by the minimal-capacity ecut-set. By 
appropriate rearrangement of rows and corresponding columns, 7'., and 
T., can each be brought into the same structural form as T. Hence each 
can be further partitioned into four submatrices (unless one is already a 
single element). The process of rearrangement and partitioning can be 
repeated until finally the diagonal submatrices of the partitioned 7’ con- 
sist only of single elements and/or 2 X 2 symmetric matrices. 

Chien gives the following simplified rule: At each partition stage the 
capacity of each edge which is connected to the subgraph to be partitioned 
is divided equally between the two partitioned subgraphs. The capacity 
of the new edge is the difference of the new terminal capacity and one-half 
the original branch capacity between all other subgraphs and the subgraph 
to be partitioned. 

In the synthesis problem, the sum of the unknown capacities of edges 
in the branch capacity matrix which correspond to the minimal-capacity 
cut-set is equated to the capacity of the cut-set. An arbitrary 0 or 1 
assignment is made to each of the entries consistent with the equation. 
Again by checking all possible cut-sets separating vertices in the parti- 
tions, the remaining edge capacities are determined. Chien gives a 
procedure for finding the total are capacity without synthesis of B. 


6-17 SIGNAL- FLOW GRAPHS 


Here we introduce general concepts of signal flow in a directed graph or 
network that has sources [6-(v) = 0] and sinks [6+(v) = 0] and possibly 
cycles and loops. The presence of cycles and loops introduces the ideas of 
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feedback in a network. In addition to quantities called gains associated 
with the arcs, we have the notion of a signal 2; transmitted from vertex v;. 
The quantity 2; is called the weight of v;. How the signals are combined 
with are gains to derive expressions for the total flow from sources to sinks 
(often called the gain at the sink) is the object of the analysis of such 
networks, The relationship between the signals at the different vertices 
may be left in general functional form or specialized to linear relations. 
In the latter case, by introducing appropriate operations on the network, 
it is possible to establish a correspondence between these operations and, 
for example, the solution of a set of simultaneous linear equations. 

The network itself may be the representation of a physical system. 
One may go directly from the system to the equations, or since it is often 
helpful in understanding the contribution of the various components of 
the system to the signal flow, one may represent the system by a network 
and, because of the correspondence with linear systems, attempt to deter- 
mine the contribution of the network components to the flow. We begin 
by introducing network concepts which are pertinent to this type of 
analysis, although in this brief account none of them will receive great 
emphasis. 

The arcs of a network may be divided into two classes: (1) feedback 
arcs, i.e., those which belong to cycles or which are loops, and (2) cascade 
or nonfeedback ares.® In Fig. 6-39 vws, vsve, vavs are the feedback arcs 
and v2, V4, Vas are the cascade arcs. The vertices may also be classified 
in the same manner depending on whether they belong to cycles or not. 
A cascade network is one in which every arc is a cascade arc. The ver- 
tices of such a network may be so labeled that the subscripts of vertices on 
every path are in increasing order. Thus one begins by labeling a source 
vertex (or vertices); then if such a vertex is removed with its connecting 
ares, one has a new network with at least one source. This new source is 
labeled v2 and removed with its ares, etc., until isolated vertices which are 
the sinks of the original graph are reached and are labeled last. The 
labeling need not be unique. In a feedback network there is at least one 
feedback vertex. A feedback unit of the network is a subgraph consisting 
of only feedback arcs and vertices. The feedback units of Fig. 6-39 are 
given in Fig. 6-40. 
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Figure 6-41 


A vertex of a feedback unit may be split into two vertices, one of 
which is incident with only the outgoing ares of the original vertex and the 
other with the incoming ares. In this manner all feedback ares incident 
with the original vertex become cascade arcs of the new network. In 
Fig. 6-41 we give a feedback unit on the left, splitting vertex v2 on the 
right. The index of a feedback unit is the minimum number of vertices 
which must be split to transform all ares to cascade ares (i.e., intercept all 
feedback cycles). The splitting operation is essentially a simplification 
of the feedback network in order to compute the total flow without the 
effect of cycles. Once index vertices have been determined (there may 
be several collections with the same number of vertices that may produce 
the desired effect), a residual network is formed from these vertices and 
the sources and sinks, All other vertices are eliminated. Arcs between 
two vertices, and loops in the residual network, correspond to (1) ares 
incident with these vertices in the original network, (2) paths connecting 
these vertices in the original network but passing through only non- 
residual vertices, and (3) loops which may be original loops or else cycles 
between a residual and a nonresidual vertex. 

We again illustrate a network and its residual where we mark source 
and sink with X (Fig. 6-42). Here the index vertices are v3 and v,. A 
network may be condensed by shrinking each complete feedback unit to 
a vertex. It is then joined to the other vertices as indicated by the cas- 
cade ares. If more than one such are was present between the vertices of 
a unit and another vertex or another unit, they are all represented by a 
single are. 

As a final concept we mention path inversion in a network, whereby 
all the arcs of a path are reoriented in the opposite direction. However, 
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there is an additional requirement 0 MENA 
that all ares originally incident with . 

any are of the path at its terminal \ yy 
vertex are disconnected from that 


vertex as their terminal vertex and Fi 

é igure 6-43 
are connected to the new terminal 
vertex (the original initial vertex) of the are. This is done for all ares of 
the path. The operation of are inversion changes the incidence rela- 
tionships of the graph. 

We now introduce algebraic operations associated with the foregoing 
concepts. In a cascade network the weight at a vertex is a function of all 
the weights that are associated with initial vertices of ares which terminate 
at the given vertex. These weights presumably undergo transformations 
through the ares, determined by the are gains. We may write, for example, 

wy = fi(wi,ee) 
indicating that the vertex v; is the terminal vertex of two ares starting at 
v; and v,. Now a; itself has a similar functional relationship with other 
vertices, and so has zy, ete. Substituting these relationships in f; relates 
2, to the weights at still earlier vertices, ete., back to the source vertices 
by use of substitution and the process of elimination of intermediate 


vertices. This type of simplification is not possible in a feedback net- 
work. For example, in Fig. 6-43 let 2; be a given weight at »; and let 


to = folwra3) ts = fa(x1,22,08) 
Substitution gives 
x9 = fo(as,fo(e1,0s),€2) = F3(a1,22) 


and cannot be explicitly simplified. 

Now we turn to linear systems in which the f’s are linear in the 2’s. 
Here we assume that the signal received at a vertex is the sum of the 
products of are gains and the weights at their initial vertices of those ares 
which terminate at the given vertex. Thus from Fig. 6-44a we have 


Xe = Cit + Cote 


Figure 6-44 
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Also, from Fig. 6-44b we have 


Le = Coe To = CX 
and hence —.&3 = ,C9ty 


cy Cs that is, the are gains multiply when 
the arcs are in series and may be re- 
placed by a single are between »; and 
v3 with a gain of c,cx. The are gains 
add when the ares are in parallel be- 
Figure 6-45 tween two vertices, and they may be 
replaced by a single are with the sum 

of their gains between the two vertices. 

These operations may be applied to a network to determine gains for 
the ares of the residual network, whose index gives the minimum number 
of variables which cannot be eliminated by explicit operations. Each 
feedback unit corresponds to the solution of a set of simultaneous equa- 
tions in the weights of its vertices, and each arc in the condensation graph 
corresponds to the elimination of a variable in terms of the variables of 
the initial vertices. 

The effect of loops and cycles is studied by reducing the network to 
the residual network and assigning the arcs the appropriate gains accord- 
ing to the foregoing rules. Note that in the residual network, a loop may 
be obtained as the result of combining an outgoing and incoming are and 
hence must have a gain corresponding to the product of their gains. Let 
v be a vertex with weight x and with a loop whose gain is c. Since the 
total signal entering v must yield x and since the loop introduces at v a 
signal cx, the signal from other ares terminating at v must be (1 — e)a so 
that the sum would be z. In that case we may replace the effect of the 
loop by the effect of an arc of capacity 1/(1 — c). Thus, for example, the 
weight at vs in the network of Fig, 6-45 takes the form 
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Ly = Coty + I Eat 


c 
Similar reductions can be accounted for in the case of several loops. 
Exercise 6-24 Show that the weight at v, in the network of Fig. 6-46 is given by 


i ab/(1 — g) +ed/(1 — h) + (afd + ceb)/(1 — g)( — h) 
1 = ef/(1 — g)(1 — A) 


Note that in order for these network operations to correspond to the 
solution of a system of simultaneous linear equations, one may formulate 
the equations of a signal-flow network as shown below and perform oper- 
ations on the network corresponding to the operations used to solve the 
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Figure 6-46 


equations. Consider the network of Fig. 6-47 with its equations, where 
2, is given and 

@y = 4a, — a 

te = % 
6x3 — Qa 


iS 
tl 


If we wish to determine the gain at 2x3, for example, we have by elimination: 
ts = Fay 


On the other hand, we can also carry out the reduction by forming the 
residual graph shown in Fig. 6-48 by using v; as the index vertex. Since 
it is desired to have v, as the sink, we adjoin an artificial vertex v, with the 
transformation along the are v3v, equal to unity, i.e., that are has unit 
gain, so that both vertices have weight «3. The two parallel arcs vy; 
yield an arc of gain 6. When the loop whose gain is —6 is replaced by an 
are of gain 1/[1 — (—6)] and the results are combined to obtain the net 
gain at v;, we have as = $x. 

Actually, the elimination process and the graph reduction may be 
made to correspond in their steps. For example, if x2 is replaced every- 
where by 2), the graph is reduced to another graph in which the gains of 
arcs passing through x are multiplied by the unit gain of the are vy», ete, 
In this manner we see the correspondence between the manipulation 


6 -6 
4 7 ‘ 4 1 
(a % a vg 
z 4 3 
: 1 : 2 


Figure 6-47 Figure 6-18 


192 Applications 


% of signal-flow networks and the 
Zs solution of linear equations. 
% 
% vg 6-18 SWITCHING NETWORKS 
zg 27 Consider a loop-free graph 
Figure 6-49 G = (V,E) 


each of whose edges e; has an 
associated variable x; which can assume only the values 0 and 1. Sucha 
graph may be considered as a mathematical model of a set of intercon- 
nected physical devices, such as switches, each of which can be in either of 
two states: on (x; = 1) or off (a; = 0). Let v1 and v2 be two fixed, distinct 
vertices of G. The graph, together with the switching variables |xi}, is 
called a switching network, and v; and v2 are regarded as its terminals. 

If there are n edges (i.e., switches) and if X = (ar... . ,@,) isa 
particular assignment of values to the switching variables, the network is 
considered to be closed relative to X if and only if the set of edges for which 
x; = 1 contains a simple chain joining v; and v2. Otherwise, the network 
is said to be open relative to X. (Alternatively, the network is closed 
relative to X if and only if v; and v2 lie in the same component of the sub- 
graph determined by those edges for which x; = 1.) 

Consider, for example, the switching network of Fig. 6-49. The sets 
of switching variables corresponding to simple chains joining v; and v2 are 
(in terms of their subscripts) 


(1,4,5), (1,4,6,7), (1,3,6,5), (1,3,7), (2,7), 2,6,5), (2,3,4,5) 


The vectors X = (x1,02, . . . ,£7) for which the network is closed are 

precisely those vectors which have a 1 in each position corresponding to 

one of these simple chains and arbitrary values in the remaining positions. 
Given a switching network N with n switches, the switching function 

f(X) associated with N is the function defined on the 2" possible values of 

X as follows: 

if N is closed relative to X 


1 
FAY Vo, — otherwise 


Thus far we have implicitly assumed that 2; is independent of 2; 
whenever 7 # j, that is, that the n switches are controlled independently 
of one another. If this is not the case, not all 2” values of X are permis- 
sible. For instance, suppose that 


t= 2 and %=f=1—xm 


in the preceding example. Then the chain determined by subscripts 
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(2,6,5) is never “closed,” since v2 and xs cannot both be 1. Similarly, the 
chain (1,3,7) is closed whenever 


m= %=1 
since then we necessarily also have 
tz =1 


The following general problem arises: Given m independent switching 
variables @, . . . , %m and a specified switching function 


S(X) = fle, «+ + sm) 


under what conditions can a switching network which realizes f(X) be 
found? Any switching function can be realized by a suff ciently large net- 
work each of whose switching variables is equal to one of the m independent 
variables or to its 1’scomplement. For example, ifm = 3 andif f(X) = 1 
for these values of X: 


Ty Le 73 
x 1 0 1 
X, 1 1 0 
Xs 0 1 1 
xX, 0 0 1 


the network of Fig. 6-50 clearly realizes this switching function. Unfor- 
tunately, the high degree of redundancy occurring in this type of construc- 
tion is generally not acceptable. One normally wishes to use as few 
switches as possible: hopefully only m. In the above example, where 
m = 3, there are only three distinct configurations of interest, shown in 
Fig. 6-51. These do not provide sufficient variety of structure to realize 
all 28 possible switching functions which may be defined on three inde- 
pendent switching variables. 

It is possible to use concepts from graph theory to realize specific 
switching functions which use a relatively small number of switches. In 


Figure 6-50 
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Ref. 64, for example, the properties of fundamental circuit and cut-set 
matrices, discussed in Chap. 5, are brought to bear on this problem, 
The question of the realizability of a switching function is shown to be 
related to the problem of whether a given matrix is the circuit matrix of 
an appropriate graph. 


6-19 POWER STATION INTERCONNECTION 


Suppose that a set of power stations »; (¢ = 1, . . . , n) wish to be inter- 
connected with each other so as to be capable of sharing each other’s power 
at times of extra need. Depending on the size and potential need of the 
various stations, suppose that station v; (i= 1, ... , n) desires to be 
directly linked to at least d; other stations. For distinct v; and v; the cost 
of directly linking these stations is a real number ¢j;. 

The “interconnection” problem is to design a system of interconnec- 
tions having minimum total cost which satisfies the demands d; of the 
various stations. This is, of course, equivalent to the following graph 
problem: Given a complete graph with vertices »; (i = 1, . . . , n) and 
with weight c;; attached to edge e;; joining v; and »;, find a subgraph which 
has minimum weight-sum of edges and which has degree at least d; at 
vertex v;. Note that this in turn is equivalent to finding a subgraph 
having maximal weight-sum, each vertex having degree at most n — d; — 1, 
and then taking the complementary graph. When viewed in this way, 
the problem is seen to involve both generalizations of the maximal match- 
ing problem discussed at the end of Sec. 6-14. 

In the special case where the vertices partition into two classes so that 
every edge joining a pair of vertices in the same class has weight equal to 
zero, the interconnection problem is the same as the transportation prob- 
lem. In general, its theory is much more complicated than in the trans- 
portation case. J. Edmonds has found a good algorithm for it (not yet 
published). 

The interconnection problem, or general optimum degree-con- 
strained subgraph problem, essentially includes most of the combinatorial 
extremum problems which are known to have a good algorithm in the 
sense that the work involved in applying the algorithm increases only 
algebraically with the size of the problem. In the case where all d; = 2, 
interconnection comes rather close to the traveling salesman problem. 
In fact, a slight modification to the interconnection algorithm will yield a 
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minimum weighted subgraph with degree exactly 2 at each vertex. If 
that subgraph is connected, it will be a minimum traveling salesman 
route for distances ¢;;. Unfortunately, the problem obtained by adding 
to the interconnection problem the further constraint that the subgraph 
be connected is still unsolved. 


6-20 PRINTED CIRCUITS 


A printed circuit is an electrical network which is formed by printing the 
connecting wires on one or more plane surfaces of a nonconducting mate- 
rial. Most commonly a plastic plate is printed on both its surfaces, and 
appropriate connections are made between the two sides by means of 
connecting wires passing through holes in the plate. Since the printed 
wires are not insulated, no two of them must cross except where a junction 
is intended. 

It is clear that such a network can be printed on a single plane surface 
if and only if it corresponds to a planar graph. The possibility of printing 
an arbitrary nonplanar network on two plane surfaces, with interconnec- 
tions between them, depends on the nature of the interconnections which 
are permitted. Suppose that the interconnections may occur only at the 
original junction points (vertices) of the network. In other words, sup- 
pose that all vertices are printed on both sides, opposite each other, and 
each corresponding pair is joined, through the plate, by a conducting wire. 
We may print some lines on one surface, and some on the other, but we 
may not create additional holes in the plate. 

With this restriction, not every network can be printed on the two 
sides of a plate. We saw in Chap. 4 that a simple graph G and its com- 
plement were not both planar if G had nine or more vertices. Thus we 
could not print the complete (simple) graph having nine vertices, since 
to do so would imply that some subgraph of G (the part of the network 
printed on one side) and its complement (the part printed on the other 
side) were planar. 

If we are permitted to create additional holes and pass conductors 
through the plate at will, any network can be printed. We could, for 
example, start by drawing the network in a plane in such a way that at 
most two wires intersect at any point which is not an intended junction. 
We could then print the entire network in one plane, except that at each 
such intersection one of the wires would be brought to the other plane 
through two holes and a short segment of this wire would be printed on 
the other plane. The material in Sec. 6-7 establishes bounds on the 
number of wire intersections which must be treated in this way. 

Assume now that it is considered desirable to print a given network 
in such a way that the fewest possible additional holes (other than those 
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at the original vertices) are required. Any particular way of printing the 
circuit can be depicted by drawing the network in a plane and using two 
types of ares, say, bold and light, to distinguish the edges printed on one 
side from those printed on the other. The problem can now be reformu- 
lated as that of inserting the fewest new vertices of degree 2 (correspond- 
ing to points at which a line is brought from one side of the plate to the 
other) and drawing the graph in such a way that at most two edges cross 
at any point other than a vertex and that, at such a point, the edges are 
of opposite types. Naturally, an edge is required to be entirely bold or 
entirely light. 


PHYSICAL SCIENCES APPLICATIONS 


6-21 CHEMICAL IDENTIFICATION 


A fundamental unsolved problem in graph theory is to find a good algo- 
rithm for deciding whether two finite graphs are abstractly the same or, 
more generally, for deciding whether one graph is a subgraph of the other. 
The first task is called graph identification, and the second task is called 
subgraph identification. For the identification problem to be, strictly 
speaking, mathematical, “good” must be given some mathematical mean- 
ing. Letussay that, for an algorithm which deals with any pair of graphs, 
“good” means that the amount of work involved in applying the algo- 
rithm increases at most algebraically, rather than exponentially, with the 
number of edges in the pair of graphs to which the algorithm is applied. 

J. Edmonds has constructed a good algorithm for tree identification 
(not subtree identification) which we shall describe. Ee conjectures that 
there is no good algorithm for general subgraph identification or even 
general graph identification. From the practical viewpoint, graph and 
subgraph identification are important in organic chemistry, where mol- 
ecules are represented as graphs. Vertices represent atoms, and edges 
represent the bonds between atoms. The fact that there are several 
kinds of atoms adds no serious complication to the identification problem. 
Considerable scientific attention is being devoted to the design of “heuris- 
tic” algorithms for chemical identification—algorithms which are not 
good in our sense but which usually work well in practice. More than 
recognizing whether two chemical structures are the same or whether one 
is a piece of the other, chemists are interested in having some sensible 
system for cataloging chemicals so that a given chemical can be easily 
found in a catalog or perhaps added to it. (Chemical patent searching at 
present is not a very pleasant task.) 

The algorithm for tree identification involves a cataloging system 
which determines a certain linear ordering of the set of all finite trees. 
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However, an identification algorithm may not present a cataloging pro- 
cedure. For example, there is a good algorithm for deciding whether any 
two surface maps are combinatorially the same. Maps have a kind of 
combinatorial “rigidity,’’ so that if there is an identification between map 
M, and M, where edge e; in M;, identifies with edge es in M2, end point 
v, of e; identifies with end point vs of e:, and face f; containing ¢; identifies 
with face f, containing es, then the identification of the rest of the parts of 
M, with parts of M/, is uniquely and easily determined. Therefore, to 
see if a map M, is identical to a map M», one need only try to identify 
edge ¢; of M, with each edge of M» in four ways, one for each possible way 
of identifying vertex v; and face f,; in Mz. This procedure does not 
immediately suggest a straightforward cataloging system for maps. 

For trees 7’, and 7s, each drawn in some particular way in the plane, 
the same “rigidity”? phenomenon can be used to determine easily whether 
1, and 7 are identical and drawn identically. By “drawn identically” 
we mean that for each pair of mutually identified vertices, 7; in T, and v2 
in Ts, the edges which meet 7; are in the drawing of 7, arranged around 
v; in the same order that the identical edges of 7, are in the drawing of 
T. arranged around v», The difficulty here is that different drawings of 
an identical 7 and 7's provide not much help in verifying identity and the 
number of different ways to draw a tree generally increases exponentially 
with the number of edges in the tree. 

One may (though we shall not explicitly carry through this line of 
thought) think of the tree identification algorithm as a procedure for 
assigning to every tree a canonical way to draw it. 

We recall that a rooted tree is a tree with one of its vertices distin- 
guished from the others by being called the root. The identification 
algorithm actually applies to rooted trees. Therefore, we first describe a 
good algorithm for assigning a canonical rooting to any unrooted tree 7. 
Let 7» be 7. Until a 7; which consists of only a single vertex or single 
edge and its end points is reached, obtain subtree 74: of T; by deleting all 
vertices of degree 1 from 7; and the edges which they meet. For any 
tree 7' this process stops with either a unique vertex or a unique edge of T 
which is called the center of T. 

In the case of a vertex, let that vertex be the root of 7’. In the case 
of an edge, let one of the end points of that edge be the root of 7—in 
particular, the one which results in the lower ranked rooted tree (either 
end point in case they result in identical rooted trees). We shall rank all 
rooted trees so that for any two which are not identical one will have 
lower rank. (Another system for assigning a canonical rooting to every 
tree 7 is to choose as root one of the vertices which results in the lowest 
ranked rooted tree. The vertex chosen is not uniquely defined, but the 
resulting rooted tree is. Whichever method for rooting is used, two trees 
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are identical if and only if their corresponding rooted trees are identical. 
The first method seems easier.) 

To every rooted tree will correspond a certain finite sequence of 
integers. No two different rooted trees will have the same sequence. 
The algorithm for deciding whether two rooted trees are identical is to 
compute the corresponding sequences and compare them. ‘There is a 
good algorithm for comparing the sequences, since both the number of 
terms in a sequence and the greatest term in the sequence are equal to the 
number of nodes in the tree the sequence represents. It will be evident 
from the definition that there is a good algorithm for computing a sequence 
from its tree. 

Rooted trees are ranked according to the lexicographical ordering of 
their sequences. That is, 7, ranks less than 7, when there is an integer 
k such that for 7 < k the jth terms of the two sequences are equal and such 
that either there is a kth term in the 7’, sequence and no kth term in the 
7, sequence or the kth term in the 7; sequence is less than the kth term 
in the 7’, sequence. 

When its root r and the edges which meet r are deleted from a rooted 
tree 7, what remains is several trees, one for each edge meeting r in T. 
These trees are called the factors of rooted tree T. Each factor of T con- 
tains exactly one vertex which in 7’ is joined tor. This vertex is regarded 
as the root of the factor. Thus every rooted tree T which is not merely a 
single vertex decomposes uniquely into one or more factors which are 
smaller rooted trees. 

A rooted tree 7 which is merely a single vertex has as its sequence a 
single term equal to 1. For every other rooted tree 7’, all but the first 
term of the sequence S representing 7’ is obtained from the sequences S; 
representing the factors 7; of 7’ by arranging the S;’s one after the other 
in order of ascending rank. Of course, in the case of some identical S,’s 
it does not matter which of them comes first, but all are separately there 
in S. Thus all the terms of all the S,’s comprise all the terms of S except 
the first. The first term of S is 1 plus the sum of the first terms of the 
S,’s, that is, the first term of S is the number of vertices in 7. 

Notice that there is a unique 1-1 correspondence between the terms 
of S and the vertices of 7 such that the first term of S corresponds to the 
root of 7’ and the other terms in S correspond to the same vertices as the 
terms in the S,’s from which they arose. Each vertex v in 7 is the root 
of exactly one rooted tree, call it 7',, which is involved in the inductive 
definition of S. In other words, 7, is a factor of a factor... of a 
factor of 7. The magnitude of the term in S which corresponds to v 
equals the number of vertices in 7’,. 

The crucial feature of the sequence S for purposes of rooted tree 
identification is the uniqueness of the ordering of its terms even though 
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tree T has no a priori ordering of its vertices. It is evident from the defi- 
nition of S that in order for two trees to be identical it is necessary that 
the sequences representing them be identical. 

It remains to be verified that in order for two trees to be identical 
it is sufficient that the sequences representing them be identical. We 
simply verify that the following uniquely determined construction for 
obtaining a tree 7’ from the sequence S which represents it is valid. 

If S consists of a single term, then the term equals 1 and 7’ consists 
of a single vertex. Otherwise, discarding the first term of S, the rest 
of S partitions into disjoint subsequences S; (¢ = 1, . . . , n) of consecu- 
tive terms in S such that, where wu; denotes the first term of S,, u; is the 
second term of S and wu; (7 > 1) is the earliest term in S after w—, which 
is as large as wi. There is no term after uv, which is as large as u,. The 
S,’s thus defined are the sequences which represent the factors 7’; of 7. 
This follows from two facts: (1) The first term of a sequence representing 
a rooted tree is greater than all its other terms. (2) The sequences 
representing the factors of T are arranged in S in order of increasing rank. 

After constructing the 7's represented by the S,’s, rooted tree 7’ 
is constructed from the 7’’s by joining an additional vertex, the root of 7, 
to the root of each 7;. Induction on the size of the sequence completes 
the description of how to construct a rooted tree from the sequence 
which represents it. 

The above method for tree identification seems not to offer much 
help toward a good algorithm for deciding whether one given rooted tree 
contains a subgraph identical to another given rooted tree so that roots 
identify. It would be very interesting to have a good algorithm for this 
more general identification task. 


Exercise 6-25 Show that the sequences obtained by deleting all terms equal to 1 
from the sequences S, representing rooted trees, are just as adequate as the sequences S 
for purposes of tree identification. 


6-22 A SIMPLE APPLICATION TO ORGANIC CHEMISTRY 


Some organic molecules may be represented as planar graphs with the 
atoms as vertices and the bonds between the atoms as edges. The 
simplest such molecules are hydrocarbons of the paraffin series, C242, 
where there are k carbon atoms considered to be vertices of degree 4 
(ineluding bonds to both hydrogen and other carbons) and 2k + 2 hydro- 
gen atoms considered to be vertices of degree 1. In the figures which 
follow, the hydrogen atoms will not be displayed as vertices, since they 
make no contribution to isomerism (i.e., to different types of bonds in a 
molecule with the same number of atoms). 

The total number of vertices » will then be 3k + 2, and the total 
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(butane) (pentane) (hexane) 
Figure 6-52 


number of edges is m = 4(4k + 2k +2) = 8k +1. Since the number 
of independent circuits m — n + 1 is equal to zero, such molecules may 
be regarded as trees. Hence, there can be no multiple bonds. The trees 
formed from k vertices include all possible carbon configurations in which 
the atoms are attached. When k > 4, more than one distinct configura- 
tion of the carbon atoms is possible. In each of the parts of Vig. 6-52 
it will be seen that the requisite number of hydrogens is present if we 
merely add hydrogen bonds until each carbon atom vertex is of degree 4. 
Each distinct configuration is called an isomer. 

The first diagram in each group of Fig. 6-52 represents what is called 
a straight-chain hydrocarbon, for obvious reasons, and the remaining 
diagrams represent branched-chain hydrocarbons. The names of the 
latter are often prefixed with iso-, for isomeric. These substances appear 
naturally as mixtures of all the possible stable isomers. With increasing 
k, the properties of the various isomers become quite different, and hence, 
in order to distinguish among them, it becomes necessary to know how 
many isomers can be present. Cayley" published, in 1875, the first 
paper applying graph theory to chemistry to solve this problem “without 
mistake or repetition.” He represented a molecule by a rooted tree, 
considered all possible configurations, and then decided which forms 
were chemically identical. For example, for k = 5, there are nine rooted 
trees, as in Fig. 6-53; but six of them are seen to be chemically identical 
with others, so that the three figures shown for k = 5 in Fig. 6-52 are 
really the only possible isomers. 

The problem of repetition can be solved by representing all the 
figures as “‘centered” or “bicentered” trees, as in Fig. 6-54. A centered 
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Figure 6-53 
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tree is understood to have two or 

more main branches, all of equal nr V4 \ | 
height, incident with its root; while 
a bicentered tree has two roots with 
one or more main branches incident 
with each root. It is essential that Figure 6-54 

the branches here also be of equal 

height. When this is so, we find that the duplication is removed, and 
three chemically distinct isomers are again present. 

By drawing all possible isomers according to this simple rule, Cayley 
determined the number of structural isomers for the paraffin series up 
to k = 13. His results are shown in Table 6-1. A more sophisticated 
result is that of Schiff,8! who published the second paper in this area, also 
in 1875. His method requires that these single-bonded, or saturated 
hydrocarbons, with k > 5 be written according to a symmetric scheme of 
main chains and shorter side chains as illustrated in Fig. 6-55. When 
trees are arranged in this manner, it is found without difficulty that the 
total number of vertices / may be expressed as a function of the number 
of vertices K on the main vertical stem (using four right triangles taken 
in pairs) as follows: 


Centered trees Bicentered tree 


K\? K2 
k=2 (9) aioe K even 
“< 2 K — 2 
b= (PY +( >) BASEL ogoad 
2 2 
Table 6-1 
kesh Bi 18) Be Ab 6) 7 8 9 10 ll 12 13 
Centered....... LO Md ih 2 8 9 20 37 86 181 422 
Bicentered o Lt 0 1 


3.3 9 15 38 73 174 380 
5 


Total. 9 18 385 75 159 355 802 


beds Se 


pee) k= 32 


Figure 6-55 
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A rather complex method can be evolved for calculating the number 
of isomers for k up to 10; then additional refinements are necessary. 
The method may be extended to double-bonded molecules with C,H 
and triple-bonded molecules with C;,H»,-2 Other research has deter- 
mined the number of structural isomers for much more complicated 
substituted molecules. 


6-238 GRAPH THEORY AND STATISTICAL MECHANICS: 
TWO EXAMPLES 


We have already alluded to the fact that a number of combinatorial 
problems of graph theory have a statistical mechanics origin. Since 
any specific physical example requires substantial background, we shall 
limit our illustration of the transfer of ideas from physics to graph theory 
to the well-known (1) Ising and (2) dimer problems. A ferromagnetic 
substance has been observed to lose its magnetic intensity as a function 
of temperature according to a decreasing curve which exhibits a sharp 
drop at a certain temperature level known as the Curie point. The 
problem is to develop a model which explains this phenomenon of sharp 
drop. The problem can be defined on a lattice, and the hope is to write 
down the partition function which for a finite lattice is an analytic func- 
tion of the temperature and does not account for the sharp drop. How- 
ever, in the limit this function can reveal this property. 

The above problem is defined on a three-dimensional lattice and is 
based on the assumption that ferromagnetism is caused by interactions 
between the spins of certain electrons in the atoms making up a crystal. 
These interactions give rise to the following generalized graph-theoretic 
version of the Ising problem. It is required to find the generating func- 
tion of the number of admissible labeled subgraphs with h edges of a 
labeled graph which is an n-dimensional lattice. A subgraph is admissi- 
ble if all its vertices are of even degree. So far this problem has been 
solved only for n = 1 (Ising) and n = 2 (Onsager). Figure 6-56 is an 
example of an Ising graph. Reference 65 is an excellent expository paper 
on Ising’s problem. Montroll has other papers on lattice statistics with 
a good collection of references. 

Another interesting problem in statistical mechanics is the dimer 
problem (attacked by Montroll with Pfaffians), which arises in the theory 
of adsorption of diatomic molecules (dimers) on surfaces. Here every 
diatomic molecule sticks on the two-dimensional lattice such that each 
of the two atoms of the molecule falls on a lattice point. The problem 
is to find the number of ways nearest neighbors can be connected on a 
doubly periodic lattice (a rectangular lattice each opposite pair of whose 
sides are identified, yielding a lattice on a torus) so that no point is left 
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Figure 6-56 


free of some atom. This problem is equivalent to that requiring the 
number of ways to cover a rectangular board divided into small squares 
with dominos each of which consists of two squares (of the same size as 
those on the board) glued together. Clearly, the board must have an 
even number of squares and must also have an even number at least 
along one side. A more elaborate mathematical version of the lattice 
problem is the paving problem, which requires the number of ways to 
cover a two-dimensional lattice of n squares with n; single squares and nz 
double squares, where n; + 2n2 = n. (See also Sec. 6-13.) 


6-24 A GENETICS PROBLEM 


The following problem due to Benzer? is related to the possible linkage 
of allowable configurations of the chemical constituents of genes. If 
these constituents are molecules in three dimensions, they may be linked 
together according to some pattern. This linkage may be represented 
in three dimensions by assigning a vertex to a molecule and joining the 
vertices by straight lines if the corresponding molecules are linked 
together. The question now is whether it may be possible to arrange 
the molecules on the same straight line so that they still have the same 
linkages. On the line, each molecule is represented by an interval, and 
the fact that it is linked to another is shown by intersecting the intervals. 
Thus the problem is to find conditions for representing the linkage graph 
on a line in terms of appropriately overlapping intervals, i.e., to associate 
intervals with the vertices in such a way that two intervals intersect 
if and only if they correspond to adjacent vertices. A graph which 
can be represented in this manner will be called an interval graph. A 
triangle may be represented on the real line by the overlapping intervals 
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(a) (b) 


To indicate more 
connections as 
those already drawn 


Circuit with n > 3 vertices (n+ 4 points; n> 2) 


(¢) (d) 


Indicating more vertices on this 
line, each connected to both 1 and 2 


(n+ 5 points; n 2 1) 
Figure 6-57 


(0,3), (1,4), and (2,5). On the other hand, a circuit of more than three 
vertices cannot be represented in this manner. 

It has been shown that if a graph does not contain one of the five 
types of subgraphs embedded exactly as shown in Fig. 6-57 (i.e., the 
vertices of the subgraph have only the connections indicated here) then 
it can be represented by intervals on a line. The biological problem in 
its generality is concerned with mutant genes where one is given informa- 
tion as to which pairs intersect (i.e., have linkages) and it is desired to 
test the compatibility of this information with a linear model of the gene. 

Fulkerson and Gross? have shown that the problem of characterizing 
an interval graph is a special case of the following problem which they 
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have solved: Given a (0,1) matrix A (that is, a matrix whose entries are 
0’s and 1’s), when can its rows be permuted so that the 1’s in every 
column of the resulting matrix appear consecutively? Note that one 
can associate a graph with an arbitrary matrix A in various ways. One 
way, for example, is to form the intersect graph of the columns of A. 
(The intersect graph of a family of n sets is obtained by associating a 
vertex with each set and joining a pair of vertices by an edge if their 
corresponding sets have a nonempty intersection.) 

In solving the problem, the concept of an “overlap” (to be distin- 
guished from “‘intersect’’) graph and that of a component graph are intro- 
duced. When applied to interval graphs, A must be taken to be the 
dominant clique-vertex incidence matrix of the graph. (A set of vertices 
every pair of which is joined by an edge is known as a clique of the graph. 
If the family of all such sets of vertices are partially ordered by set inclu- 
sion, the maximal elements are called dominant cliques of the graph. 
Since two vertices are joined by an edge if and only if they belong to some 
dominant clique, the dominant clique vs. vertex matrix characterizes 
the graph.) The result is that a graph is an interval graph if and only 
if the dominant clique-vertex incidence matrix of the graph has the 
consecutive 1’s property. Gilmore and Hoffman*! have proved that a 
graph G is an interval graph if and only if every quadrilateral in G has a 
diagonal and every circuit of odd length in the complementary graph 
has a triangular chord. A triangular chord of a circuit through vertices 
v1, . . . , %is any of the edges (v,ri42), 1 < i < & — 2, (Y-1,01), OF (e,02). 


HUMAN SCIENCES APPLICATIONS 


6-25 GRAPHS AND CYBERNETICS! 


In reading the present section, the reader should compare and contrast 
the concepts presented here with those discussed in Sec. 6-30, He will 
find an overlapping of concepts, but the common ideas appear in two 
quite different settings in these two sections. Also, the transformations 
discussed here are more general than the special transformation in which 
each member of a group is mapped into a new member by adding (assum- 
ing an additive group) a fixed member of the group. 

The concepts of change and of selection are of paramount impor- 
tance in the study of biology and evolution. Let us consider first the 
concept of change, e.g., the change in color of human skin in the presence 
of sunlight. We call the pale skin the operand, the dark skin the trans- 
form, and the sunlight the operator. Let the change itself be called the 
transition. The transition may be represented as 


Pale skin — dark skin 
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A set of transitions, by one operator, over a set of operands will be 
called a transformation. It may be represented schematically as 


a ioc 
a Ore. 
The transformation is closed if the set of transforms contains no elements 


not already present in the set of operands; thus 


(ner 
c@ a 


is closed. A transformation is single-valued if it converts one operand to 
precisely one transform. (The set of transforms may contain duplicate 
elements). Thus 
| abe 
ea. a 


is single-valued. On the other hand, 


we b c 
e or @d e or f g or h 


isnot. Furthermore, the transformation is one to one if the set of trans- 
forms contains no duplicate elements. For example, 


| abe 

ca 0 
is one-to-one but 

| abe 

caa 


is not. 

Let us now consider an example of successive single-valued transfor- 
mations on a set of operands. This example will demonstrate ‘‘selec- 
tion” on a set of operands: 


ABCDEFGHIJKLMNOPQRS TUV WXYZ 
DIFFERENCEBET WEENAMERMAIDA 
FCRREAEWFEI BEAEEWDT EATDCFEFD 
RFAAEDEARECEEDEEAFEEDEFIFRFE 
ARDDEFEDAEFEEF EEDREEFERRAR 


(The second line is a fragment of a sentence selected from Mark Twain’s 
“Letters from Earth”). Note that after only four transformations the 
original set of 26 elements has been reduced to 5. The foregoing suc- 
cessive transformations may be represented by a directed graph which 
Ashby calls the kinematic graph (Fig. 6-58). Note that the vertices 
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Figure 6-58 


incident with the cycles (or loops) will not be eliminated under successive 
transformations. The cycles of a kinematic graph are called “equilibrium 
basins,” since they are invariant or periodic under successive transfor- 
mations. In evolutionary terminology, the elements incident with 
eycles are the “survivors” under this transformation. (The elements 
in the original set of operands might be the elements in a gene pool under 
radiation.) Note that no element will be lost under transformation only 
in the special case of a 1-1 transformation. In graph-theoretic terms 
we have the following theorem: 

Theorem 6-11 The kinematic graph consists of a number of vertex- 
disjoint simple cycles if and only if the transformation is one to one. 

Proof: If the transformation is one to one, then the negative and 
positive degree of each vertex is unity. Thus the ares can be partitioned 
into simple cycles, which are necessarily vertex-disjoint. If the ares 
form a set of vertex-disjoint cycles, the positive and negative degree of 
each vertex is unity, and so the transformation is one to one. 

The identity transformation | a b ewill have a kinematic graph 

eb ¢ 

consisting of k = 3 loops (Fig. 6-59). When the transformation is not 
one to one, the number of times it must be applied to select the minimal 
number of elements equals the number of ares in the longest of the paths 
from a vertex not in a cycle to the first vertex in a cycle. 

The important point to Ashby as a biologist is that under any other 
than the 1-1 transformation, the variety 
in the set of elements will decrease and 
that it can never increase. Some subset op a) 
of the original set will be “selected” by 
the transformation. = b s 

Transformations can be represented Figure 6-59 
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U5 % % 


%) v2 Yy V9 


Figure 6-60 


by means of the vertex matrix, for example, 


a 6 6 

(e i : becomes aj}0 01 
b}1 00 

e110 20 


The problem of which elements will be selected by the mth repetition of a 
transformation is equivalent to asking which columns of the matrix will 
contain positive elements after raising the matrix to the mth power. 

The matrix of a 1-1 transformation is the only matrix of a single- 
valued transformation having unit entry in each row and column; there- 
fore it is the only matrix capable of being primitive. Thus 


abe 

ie ‘ ‘ yields a|0 0 1 
b}]1 00 

¢ 0" 1. o 


The third power of this matrix is a matrix all of whose elements are equal 
to unity. The matrix will be primitive only if the kinematic graph is 
connected; otherwise, there will be & primitive subgraphs. This treat~ 
ment can be extended to multivalued transformations equivalent. to 
Markov chains. 

In addition to the notion of an isomorphism for transformations 
which are consistent with our definition for graphs, Ashby also defines a 
homeomorphism for transformations. The requirements for homeo- 
morphism are less rigid than those for isomorphism. Two sets of 
operands are homeomorphic if a many-to-one transformation applied 
to the more complex can reduce it to a set which is isomorphic with the 
simpler. In graph terms, two graphs would be homeomorphic if an 
appropriate collapsing of subgraphs of the graph with the greater number 
of vertices to single vertices yields a graph which is isomorphic with 
the simpler graph. The two graphs of Fig. 6-60 are homeomorphic. 
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6-26 SOCIAL SCIENCE APPLICATIONS 


A directed graph can be used to represent a social hierarchy or kinship. 
Occasionally, studies in anthropology reveal complicated kinships for 
which the following formulation is applicable. We give a simple illus- 
tration of the ideas. For example,®! we consider a family group consist- 
ing of David (D), his son John (J) and daughter Grace (G), John’s wife 
Sylvia (S) and two sons Michael (M) and Richard (R) and daughter 
Emily (E), and Grace’s son Ben (B). The relationships P (being a son 
of) and Q (being a child of) are illustrated by the directed graphs of 
Fig. 6-61. The associated matrices of these two graphs are given by 


BAH auway 


ecooororoUSGocoorocoY 
ecocorscoMesccoc or oo @ 
eccooccc oo FV oocococooc FY 
i el 
ee 
ccooccooceo FH occocccoc BF 
ecoooccoeo Moco ccooe A 
cocsococooeo KR occ ocooc & 


© 
i] 
SPH ni wou 


B R M B E R M 
Graph of relation P (son of) Graph of relation Q (child of) 
Figure 6-61 
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We can then compute the product of these two matrices to obtain the 
matrix for the relation PQ (being a son of a child of, i.e., a grandson of). 
We have 

DG BSS fh BPs 


0 elsewhere 


PQ 


Bren wayg 
=PrPeooroo 


Here is another illustration of the applicability of graph theory to the 
social sciences. In an experiment in the study of the effectiveness of 
communication patterns, each of five subjects was given five symbols 
out of a set of six symbols*’ shown here: A, +, *, 0, ©, ©. Only one 
symbol was held in common by the five subjects during each trial. 
Records were kept on the number of messages and the amount of time 
necessary for each subject to determine, by communicating with others, 
which was the common symbol. 

The four communication patterns of Fig. 6-62 were used. (Here 
communication is assumed possible in both directions along each edge.) 
These patterns provided increasing degrees of centrality for subject C 
and increasing degrees of isolation for the other four subjects as a group. 
The structure of the graphs permits communication flexibility; i.e., within 
any given pattern there are several ways to transmit information and a 
leader could have developed at any vertex (but C was most often the 
leader). These patterns allowed personal evaluation by the subjects 
on their satisfaction with different vertex positions. It was found that, 
the wheel is the best organized for communicating, i.e., required the 
smallest number of information messages, and the circle the least organ- 
ized, i.e., required the largest number. The time for solution on the 


c C A B B 
B ; 
B D D c é 
D 
A E A E E E 'D 
(1) (2) (3) (4) 


Circle Chain Y Wheel 
Figure 6-62 
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wheel was less than for the other patterns, all three of which required 
nearly the same time. The greatest satisfaction of the whole group 
was with the circle and the least with the wheel. The position occupied 
by centerman C was most favored for all patterns where position made 
a difference. 

In another experiment three subjects A, B, C were assigned the task 
of reconstructing a total list of 25 words where each of them had a partial 
list.*® Their lists contained sequential word couples such that the second 
of a couple on, for example, A’s list was the first word of a couple on B’s 
or C’s list. This idea is illustrated in the accompanying table. 


Total A B Cc 
1. South south | south 
2. Through through | through 
3. Fine fine fine 
4. Gate gate | gate 
5. Do do do 
6. West west: west 
7. Have have | have 
8. Rain rain rain 
ass ete. ete. 


Another task the subjects were assigned was to reconstruct a sentence 
25 words long (where no word has more than three syllables) when each 
subject had a partial list of the words as they appear in sequence in the 
sentence. One such sentence was: “The picture we saw was painted 
by an old woman who had been taught how to mix the colors by one 
of the native artists.’’ One partial list might have been: 


picture was an who been to the colors of native 


The third task assigned was an anagram of three-letter words made 
from words of nine letters each with four vowels, e.g., abolished, courte- 
san, Subjects had to maximize the group score, and the same word on 
each paper counted three times. The five communication patterns of 
Fig. 6-63 were used. Note that in this case communication was limited to 
specified directions. For the different patterns the length of time, the 
number of messages, and the difficulty in correcting errors in transmission 
were measured. The subjects were in different rooms, connected only 
by earphones and microphones. The wires were supplied with white 
noise, i.e., all frequencies at the same intensity, to dampen sounds that 
went through the walls. In addition, each task was tested under three 
different signal-to-noise ratios, +6, —2, and —10 decibels, which gave 
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B B B B B 
Ba ya on Pee 
Figure 6-63 


correct reception 85, 66, and 24 per cent of the time, respectively. It 
was found that the first pattern was best for the first task, the third 
pattern was best for the second task, and all nets were the same for the 
third task. The fifth pattern was the worst for the first and the second 
tasks, and the different noise levels only accentuated the disadvantage 
of this pattern. 


6-27 A MATHEMATICAL MODEL OF DISARMAMENT® 


One may define the set = consisting of a finite number of states, where 
each state represents a level of arms available to two competitive oppo- 
nents X and Y under conditions of stability. Stability, balance, or equi- 
librium is an essential criterion in this formulation. It requires that 
neither opponent judges his position, i.e., state (defined below), to be 
weaker than his opponent’s. 

The elements EZ; (j = 1, . . . , p) of = are vectors 


Ej = (aaj, . « « ,Qnjsbrj, « » « bri) 


where a;; denotes the number of units in a weapons system (e.g., a gun, an 
amount of information, an economic factor, ete.) of type k available 
to X at step 7 of the disarmament process and by; is a similar quantity 
available to Y. Each of X and Y will select a set of rules to be applied 
to an initial state of arms to produce a new state. The same rules or 
different rules may be applied to the new state to obtain a third state, 
ete. X’s total scheme of arms reduction will produce a set of states 
any of which need not be acceptable to Y. The object is to find those 
states on which there can be agreement and then establish rules of reduc- 
ing arms on these states. It is assumed that the initial state to which 
the rules are applied will be considered as an equilibrium state by both 
sides not necessarily for military reasons only but also for political, eco- 
nomic, or other reasons. Our subsequent discussion will show the 
dependence of the entire process on compensating factors used by both 
sides. Our purpose now is to show how one obtains >. 

An equilibrium or stable state is a state that is an admissible state 
of both sides. A natural criterion for selecting admissible states is for 
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X to take a; = axjb.j, where a; is known as the compensating factor. 
In fact, comparisons must be made with all of Y’s weapons. 

It is clear that a common denominator for units of arms reduction 
is essential to this formulation. Thus if there is a numerical superiority 
of one weapon type, this may be compensated for by superiority in the 
reverse order by another weapon. The deficiency (or advantage) ayj 
must be measured in the basic unit of both weapons. Indeed, the com- 
pensation may be on the basis of several other weapon types, rather than 
one other weapon type, and hence a unit of common measurement is 
required. 

A single judgment factor may not be adequate to determine whether 
a state is admissible; hence, we define a state H; to be in the set 2. of states 
admissible to X if ||«;||—called the norm of 


a; = (aj, « - « ,Ons) 


the vector of compensating factors—is not less than a specified value « 
selected by X. The norm ||q;|| is a measure of all a; (kK =1,...,7). 
Since the importance of weapons will differ from one type to another, a 
useful norm to take for a; is 


llesll = > Wren; 
k=1 

where w, is a weight of the importance of weapon type k averaged over 
different conflict situations. Similarly, one may introduce {;; and |{8;|| 
for the definition of 2,, the set of Y’s admissible states. Note, for exam- 
ple, that the state (0, .. . ,0;1, . . . ,1) is admissible to Y but not to 
X, and hence it isin 3, but notin Z,. Similarly, (1, ... ,1;0,... ,0) 
isin 2, but notin Z,. It is easy to assume that such states are admissi- 
ble, since one side will have zero arms. Finally, the set of admissible 
equilibrium states (of both X and Y) is = = =, 3,; that is, it is the 
common part of both. 

One problem of arms control is to prescribe rules for reducing arms. 
(Note that we are using reduction in the broad sense, since certain indi- 
vidual weapon types may actually require increase.) Whatever the 
rule, its purpose is to effect a transition from one state in = to another 
state in =. The rules developed by each competitor need not coincide 
with those of the other, because, for example, 2, will naturally contain 
states not in 2,, and conversely. Thus the problem is to find those rules 
which provide a sequence of transitions between states of © without 
ever going into a state not in. In practice, it is usually the rules that 
define the states. 

Assume now that we have a listing of all the elements of the set 2. 
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(Clearly, this is difficult in practice, because neither side would be willing 
to acknowledge his compensating factors. However, one of the advan- 
tages of this approach is to show how, by offering the other side various 
rules which he accepts or rejects, it is possible to guess at approximate 
magnitudes of his compensating factors.) These elements are obviously 
finite, although the escalation of weapons increases the size of 2 in time. 
For simplicity, let us assume that the states of 2 are H,,... , H,. 

If the problem of finding transition rules is solved, the next problem 
is to find a method of applying these rules to obtain all those states which 
fall on a path of arms reduction from a given initial state, for example 
Fy, to any intermediate state 2,, q¢ <r. If there is no such path, the 
rules are inadequate and must be altered to make the steps possible. 
It is clear that the transition from the initial to a designated intermediate 
state can be effected in a single step, but because of the many possibly 
adverse effects of large reduction steps in arms it is essential to carry out 
the reduction in tolerably small steps. In addition, a single step need 
not be acceptable to both sides or feasible as far as safety is concerned, 
because it takes time to effect arms reduction and monitor its execution. 
A useful outcome of this approach is the fact that it can be used to deter- 
mine whether it is impossible to reach a prescribed state from an initial 
state according to a given set of rules. In other words, not every method 
which one may advance, even a seemingly good one, is guaranteed to lead 
to a stable predetermined state, assuming that this method is to be used 
over and over again. 

The first problem of choosing the rules of transition is nonmathemati- 
eal. It depends on several political, military, and economic factors. 
Its solution requires detailed information and judgment from these areas. 
However, the problem of applying the rules to determine the possible 
intermediate steps of disarmament can be studied mathematically even 
if the rules are changed between the steps; for then all that need be done 
is again to apply the method using the state at which the rules are changed 
as an initial state. One can now associate vertices with the states and 
apply the approaches to transition problems discussed earlier. 


6-28 GRAPHS AND LINGUISTICS} 


A language is composed of a finite set of recognizable characters compris- 
ing an alphabet, together with a finite set of rules for combining the 
characters. The set of rules is called a grammar, I’, and the sequences 
of characters which can be formed according to T are called the strings, 
2, of the language. In particular, the characters of the alphabet may be 


+ For references, see end of the section. 
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man hit the ball 
Adjective Noun Adjective Noun 
Verb ner 
Noun Noun 
phrase phrase 
Verb 
phrase 
Sentence 
Figure 6-64 


considered as special one-character strings. A language A is then com- 
pletely characterized as A = (2,T). 

A typical problem of computational linguistics is the recognition ofa 
fixed string B) as belonging to a language. The recognition process 
involves diagramming the string, i.e., finding grammatical types such 
as nouns, verbs, and phrases from the string (Fig. 6-64). In general, 
natural languages cannot be completely characterized by grammars 
alone (Chomsky, 1962). One of the foremost problems is finding: 


1. Subsets which can be characterized by grammar. 


2. An adequate (nongrammatical 


Two problems arise in analyzing 


1. Admission of nonsense string: 


furiously. 
2. Admission of grammatical am| 


) model for natural language. 
natural-language strings: 


s such as: The green ideas sleep 


iguity: Pretty little girls’ camp. 


This second example can be diagrammed as several distinct noun phrases, 
while noun phrases can always be used equivalently. 


Definition of a Grammar! 


A category is a set of strings which are denoted by the same name. 
There are three kinds of categories: 

1. Arbitrary sets of one character strings 

2. Products of categories; c = AB = C contains all strings ¢ = ab, 


where ae A, be B 
3. Unions of categories;c = AU Boece A orceB 
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A grammar is a list of categories and their derivation. No ordering is 
implied, except that category derivations which are unions must appear 
lower in the list than the categories which form the union. This restric- 
tion avoids circularity of the form 


A=BUC B=CUA C=AUB 


Example of grammar 


TV 

2. U 

3. B 

4, P= BF 
5. Q@=PUU 
6. R = QF 
7.F=VUR 


Let a, 8, and y be types of vertices corresponding to the three kinds of 
categories. The grammar above can be represented by a graph‘ (Fig. 
6-65). Note that the graph has names for only a-type vertices and the 
vertex which is used recursively. This suggests the following distribu- 
tion and substitution: 


F=VUR 
=VUQF 
=VU(PUU)F 
=VU(BFU U)F 

F=VUBFFUUF 


This may be interpreted as a grammar for a simple prefix language 
where F' means formula, V means variable, B means binary operator, and 


Figure 6-65 Figure 6-66 
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U means unary operator. The recursive definition for formula is: 


1. A variable 
2. A binary operator followed by two formulas 
3. A unary operator followed by one formula 


The corresponding graph is given in Fig. 6-66. The graph of a gram- 
mar is called the T graph of a language and is used to (1) generate all 
strings or (2) recognize arbitrary given strings. The strings of the prefix 
language are: 


ir 

BVV 

UV 
BBVVV 
BVBVV 
BBVVBVV 


DOE a Cris eet 


The following points may be made: 


1. The I graph of a language is finite. 

2. The set of all strings = is infinite. 

3. The set of all strings = is enumerable. 

4. An arbitrary string 2, can be represented by a tree. To see this, 
consider 35, the sixth string above, Fig. 6-67. 


Note that = graphs are directed, rooted trees, each vertex of which is 
named by the corresponding category name. It is usually required that 
the > graphs of formal languages be connected and single-headed, i.e., 
trees and not forests in the terminology of graph theory. It is also usual 
to expect that the strings of a language are unambiguous; that is, each 
string has a unique = graph. While it is easy to tell if a given string is 
ambiguous with respect to a given 
grammar, it is difficult to determine 


a B Vv if B Vv Vv 
whether or not a grammar is ambi- 
guous or a language (which may have t t Re 
many different grammars) is inher- 
ently ambiguous. In fact it has r FP 
been shown? that whether or not a 
formal grammar (of the type de- ‘ 


scribed here) is ambiguous is unsolv- 
able. On the other hand, if the Figure 6-67 
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strings of a language are enumerable, they may be checked for ambi- 
guity for sufficiently large numbers.4 

The efficient recognition of strings by machine has been a central 
problem of computational linguistics. Many different devices have been 
used, including no-back-up indicators and precedence functions for each 
category. (For example, do not back up past a sentence structure, do not 
look for a verb phrase unless a noun phrase has been found.) Some 
investigation has been made into special restrictive grammars.* 

It might be more fruitful to find special restrictions or extensions on 
the I graph of a language. One extension which appears interesting is 
the assignment of length functions to the arcs of the graph. Initially, 
all lengths are equal, but as each arc is successfully used, its length func- 
tion is increased. Alternative paths are always chosen on the basis of 
length, so that the machine, based on ‘“‘experience,” improves the effi- 
ciency of its operation. If such a weighting scheme were built into a 
FORTRAN compiler, the compiler would (in some sense) learn the style 
and programming idiosyncracies of the individual programmer. 

It should be clear that computational linguistics is not concerned 
with languages as much as it is concerned with structure and pattern 
recognition in linear strings. Thus far, the focus has been on artificial 
languages and special subsets of natural language, and the objective has 
been mechanical analysis. 

Recently, a project which will attempt to characterize large stores of 
tabular data structurally has been initiated. In this instance, the alpha- 
bet consists of table entries—sets of related properties—which can be 
recognized. Categories of information are defined structurally from 
entries. It is hoped that this will prove to be an effective technique for 
dealing with nonhomogeneous data which are stored sequentially (as on 
a magnetic tape). 
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ADDITIONAL APPLICATION 


6-29 MATHEMATICAL MACHINES AND MARKOYV CHAINS 


Many physical systems can be characterized by itemizing the various 
“states” which they may assume and describing the way in which they 
will respond when in any given state and when subject to each of a num- 
ber of possible “input stimuli.” In general, the response consists of a 
change of state together with the production of appropriate ‘‘output.” 
The concept of a mathematical machine results when one formalizes 
this idea, 

Recent work in the theory of mathematical machines (sometimes 
referred to as the theory of automata) seems to bear little resemblance 
to the deep researches of Gédel, Turing, and other mathematical logi- 
cians. The new theories apparently consist largely of elementary the- 
orems, but suggest, however, difficult finite combinatorial problems, pos- 
sibly amenable to graph-theoretic techniques. 

Definition A machine is a mathematical system consisting of: 


= 


. A finite set S = {s1, . . . ,8m} of elements called states 
2. A finite set X = {a, . . . ,2,} of elements called inputs 
3. A finite set Y = {y, .. . jy} of elements called outputs 
4. A transition function T which maps S X X into S 

5. An output function Q which maps S X X into Y 


If se S and xe X, then s’ = T(s,x) is interpreted as the next state 
which the machine will be in if its present state is s and if it is presented 
with input x. Similarly, y = Q(s,x) is the output produced when the 
machine is in state s and is presented with input «. The sets X and Y 
are referred to as the input and output alphabets (although the nature 
of their elements vary greatly from one application to the next). 

The above definition of “machine” should be qualified in several 
respects. Such machines are deterministic, since the output and next 
state are completely determined by the input and current state. They 
are sequential, since we are assuming that inputs are presented, and 
reacted to, at discrete points in time ty, fz, ts, . . . rather than continu- 
ously. They are complete, in the sense that every combination of state 
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and input is considered meaningful and produces a known output and 
next state. They are memoryless in the sense that the current output 
and next state do not depend on past inputs, states, or outputs. Finally, 
they are stationary in the sense that the transition function 7 and output 
function 2 do not depend on the point in time ¢; under consideration. 
One can formulate more general machines by altering some or all of these 
assumptions. 

For some purposes, it is convenient to represent a machine by a 
directed graph whose vertices are the states and whose ares—with appro- 
priate adornments—characterize X, Y, 7’, and Q. This is most easily 
explained in terms of an example. Consider a machine for which 


S = {81,82,83,8s} X= {01} and = {a,b,c} 


The graph in Fig. 6-68 describes one possible machine associated with 
state set S and alphabets XY and Y. Each vertex corresponds to one 
state and is the initial vertex of precisely two arcs (in general, k arcs, where 
k is the number of distinct inputs). If an are having initial vertex s 
has terminal vertex s’ and associated ordered pair (x,y), the interpreta- 
tion is that 

T(s;2) = 8 Xs,z) = y 


For example, if the system is currently in state s3, then input 0 will yield 
output b and transform the system to state sy On the other hand, 
input 1 will produce output a and the system will remain in state s3. 
(In practice, one may reduce the number of ares by labeling some ares 
with several ordered pairs if several inputs will yield the same next state.) 

One can classify individual states, and sets of states, in terms of 
structural features of the associated graph. For example, a strongly 


(1a) 


(1b) () (0) 


(0,b) 


(1,a) 


Figure 6-68 
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connected machine is one whose graph is strongly connected. For such a 
machine, one can start with any state s and transform the machine into 
any other state s’ (but not necessarily in one “‘step”) through an appro- 
priate sequence of inputs. <A transient state s is one which is the initial 
vertex of at least one are (s,t), where ¢ # s, but is not the terminal vertex 
of any are (u,s), where u # s. For example, s; is a transient state. A 
persistent state s is one which is the terminal vertex of at least one are 
(t,s), where ¢ # s, but is not the initial vertex of any are (s,u), where 
us. (The machine under consideration has no persistent individual 
states. However, states s», ss, and s, jointly constitute a persistent set 
of states in an appropriate sense.) 


Given any initial state s and any tape, or finite sequence a, . . . , a 
of inputs, one can use the graph to determine the resulting final state s’ 
(after ¢ transitions) and the corresponding output sequence y:, . . . , ye 


readily. For example, if the machine is currently in state s; and if the 
next five inputs are 1, 1, 0, 1, 0, then the machine will pass successively 
into states 83, 83, 84, 82, 8; and yield outputs ¢, a, b, a, a. 

Among the classes of problems which arise in machine theory are 
these: 


1. Analysis of the responses (transitions and outputs) of a given 
machine. 

2. Synthesis or construction of machines having specified response 
characteristics. 

3. Reduction of machines to “simpler” machines which are in some 
sense equivalent. 


The reader is referred to Gill*® and Ginsburg*® for extensive discus- 
sions of these and other topics in the theory of mathematical machines. 
It has been our objective here merely to indicate that one convenient way 
to present a machine is by means of a directed graph, with associated 
symbols, and that this point of view is useful in connection with the 
classification of machines and with some kinds of analyses of machines. 

The notion of a Markov chain is, in some respects, a probabilistic 
counterpart of the (deterministic) mathematical machines. Here again 
we have a system which may be in one of a finite number of states and 
which is subject to state transitions at discrete points in time. Now, 
however, the transitions do not depend on any (controllable) inputs, 
but are governed by probability distributions. No outputs are associated 
with the process. One is frequently interested in the probability dis- 
tribution of states, as a function of time, given a particular initial state. 

Expressed formally, a Markov chain is a mathematical system con- 
sisting of 
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1. A finite set S = {s:, . . . ,8n} of elements called states. 

2. Ann X n transition matrix P = {p,;}, where p;; is the probability 
that at the next observed point in time the system will be in state s;, 
given that it is currently in state s; It is required, of course, that 


Spi =1 fori=1,2,...,% 
a 


A Markov chain, as defined here, is sometimes called a stationary 
(or time-invariant) Markov chain to suggest the fact that there is a more 
general notion in which the transition probabilities may be a function 
of time. 

The graph of a Markov chain is the directed graph having the states 
as vertices and having an arc from s; to s;, labeled p;, whenever pi > 0 
(that is, whenever a “one-step” transition from g; to s; is possible). Fig- 
ure 6-69, for example, depicts a five-state Markov chain. Given that 
the system is currently in state s:, the probabilities are 0.2, 0.3, and 
0.5, respectively, that the system will pass to state 3, pass to state 85, 
or remain in state se. The other ares have similar interpretations. 

A qualitative classification of states, and sets of states, can be made, 
based on structural characteristics of the graph and ignoring the specific 
values of the probabilities (except that nonzero probabilities have been 
distinguished from zero probabilities). For example, a set of states 
T CS is a trapping (or absorbing) set if the directed cut-set {7, S — T} 
is empty (i.e., there are no possible transitions from a state in 7’ to a state 
not in 7). [In particular, an individual state s; constitutes a trapping 
(or absorbing) state if and only if pw = 1.] A Markov chain is said to 
be ergodic if the associated graph is strongly connected. Thus an ergodic 
chain is one such that, given any current state s,, there is a nonzero prob- 


2 


0.4 


0.6 


Figure 6-69 


A variety of interesting applications 223 


<< >e os 


(a) (b) 
Figure 6-70 

ability of reaching any other state s; after an appropriate number of 
transitions. An ergodic chain is further said to be regular if there is a 
positive integer fo such that for any states s; and s; (with possibly 7 = j) 
there is an are progression from »y; to v; having precisely ¢ arcs for every 
t >t. In Fig. 6-70, for example, the chain shown in (a) is ergodic but 
not regular. (Starting with state s;, for example, one can be in state s3 
only after an even number of stages.) In contrast, the chain shown in 
(6) is regular. 

Exercise 6-26 Referring to the definition of a regular Markov chain, what is the 
smallest value of to which will suffice for the chain shown in Fig. 6-70b? 


Exercise 6-27 Give a graph-theoretic proof of the fact that if the graph of an 
ergodic chain has at least one loop, then the chain is necessarily regular. 


The reader who has an interest in Markov processes is encouraged 
to formulate, in graph-theoretic terms, additional concepts associated 
with the classification of states (as discussed, for example, in Refs. 21, 55, 
and 73). 


6-30 GROUPS AND SIMPLE GRAPHS 


Every simple graph has at least one self-isomorphism, namely, the trivial 
isomorphism in which each vertex and edge corresponds to itself. How- 
ever, there may be additional ways to set up an isomorphism of the 
graph with itself so that one has other than the identity isomorphism. 
An isomorphism of a graph with itself is known as an automorphism. 
The totality of automorphisms of a graph form a group which is known 
as the group of the graph. This group can always be considered as a 
group of permutations of the vertices of the graph. The automorphisms 
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of a polygon with 2n sides (an n-gon) give rise to a 
group known as the dihedral group of order n, whereas 
the group of automorphisms of the complete graph 
on n vertices is the symmetric group of order n. 
The order of the group is called the symmetry num- 
ber of the graph. It is as yet not known whether in 
general every permutation group is a graph group. 
This problem is contained in that of finding the num- 
ber of nonisomorphic graphs with a given permutation 
group. 

Figure 6-71 Exercise 6-28 Find the automorphisms of the graph in 
Fig. 6-71. 


Consider the operation of addition of integers modulo a given integer, 
and let us indicate by means of a graph the relations among the members 
of the residues when a certain member is added to all of them, For 
example, take the integers mod 6. This leaves the residues 0,1,2,3,4, and 
5. Let us add 4 to all of them and indicate the subsequent relations. We 
start with zero. This produces two cycles, Fig. 6-72. 

On the other hand, multiplication by 4 yields the graph of Fig. 6-73. 
Because they map into themselves, 0,2, and 4 are called fixed points. 
One can similarly obtain a graph for a function f(x) = ax + b mod 6, 
for example, where a and b are members of the residue class and x takes 
on values in that class. 


Exercise 6-29 Draw the graphs corresponding to the squaring and the cubing of 
each of the residues mod 6. 


Remark: A fraction $ mod 7 is resolved as follows: We first determine 
the member x of the residue class which when multiplied by 3 yields 
1 mod 7; that is, } = x mod 7, and we find that z = 5. We multiply 
4 X 5 and obtain 6 mod 7; hence 4 = 6 mod 7. 


Exercise 6-30 Obtain the graph of the residue class mod 7 mapped according 
to f(z) = (22 + 3)/(x + 2). 


The foregoing ideas give rise to graphs that are known as Cayley 
color graphs or Venn diagrams. F ‘ 
We start with a finite group and 
select a subset of the group (e.g., a i es” 4 ea 5 
set of elements which generates the fe 
group). We associate a vertex 
with each element of the group 2 
and an are terminating at the ver- Figure 6-72 
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tex to which the element is transformed on being transformed (e.g., by 
multiplication or addition) by a member of the subset. 

Thus each vertex is the initial vertex of as many arcs as there are 
members of the operating subset. Lach are is colored with a specific 
color associated with each element of the subset. Note that a loop is 
associated with a vertex when the corresponding element is operated on 
by the identity. This yields a Cayley color graph. It is known that 
such a graph is connected if and only if this generates the group, i.e., 
there is a path between any two vertices, since, by successive multiplica- 
tion by the subset elements, one can operate on any element to get any 
other element. Thus 4 is not a generator of the group defined by the 
residues mod 6 under addition, since the graph is not connected. 


Exercise 6-31 Show that 5 isa generator of this group by showing that the corre- 
sponding graph is connected. 


6-31 CONSTRUCTION OF TREES OF MINIMUM TOTAL LENGTH 


It is easy to visualize problems in which it is desired to build roads 
between several centers where there is one and only one path connecting 
any two centers. Of all such possible road systems between the centers, 
we seek one with minimum total length. This is the problem of finding 
a spanning tree of a graph with minimum total length. Note that a 
necessary condition for a tree to have minimum total length is that the 
length of every chord be greater than or equal to the maximum of the 
lengths of branches in the fundamental circuit which it determines. 
Otherwise, a single replacement using this chord could be made. It 
turns out that this is also a sufficient condition, but the proof is not 
immediate. 

To choose a tree of minimum total length, we first index the edges 
according to increasing lengths so that the length of ¢; is less than or 
equal to the length of e; whenever i <j. We then start by selecting ¢; 
and add és if e, does not form a circuit with e;. In general, we continue 
to consider edges of successively higher indices, selecting an edge when- 
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ever it does not form a circuit with the set previously selected and reject- 
ing it otherwise. This process always yields a tree of minimum total 
length. (See Ref. 56 for proof.) 


6-32. GRAPHS AND EIGENVALUES OF NONNEGATIVE MATRICES 


Directed graphs have been used by Dulmage and Mendelsohn” to study 
various properties of the characteristic equation and eigenvalues of 
matrices which occur in stochastic processes, economics, and numerical 
analysis. A directed graph is cyclically k-partite (k > 2) if and only if 
the set of vertices V can be partitioned into k sets Vi, . . . , V, such that 
all ares satisfy the following property: If an arc has its initial vertex in V>, 
then its terminal vertex is in V4, if p< k and V, ifp=k. We can 
associate with any n X n matrix A = {aj} a directed graph, the coeffi- 
cients of whose adjacency matrix are unity if the corresponding a, + 0. 
If this graph is cyclically k-partite, then corresponding to a partitioning 


Vi, ...., Ve there is a permutation matrix P such that 

0, Bi 

0. Bs 
0; Bs 
PAP = 
Be 

B, 0. 
where 0;, . . . , Og are zero matrices and every element of P-'AP not 
in By, ... , Beis zero. The matrices B,, ... , B, are known as the 


cyclic components of A. 

Thus, a cyclically k-partite graph has the structure exemplified 
in Fig. 6-74. Note that every are progression (and in particular every 
path and cycle) having nm arcs has the property that if its initial and 
terminal vertex are in V; and Vj, respectively, then 7 = i + n (mod k). 
In particular, the length of every eycle is a multiple of k. 


A Va Vs Vea VE A 


Figure 6-74 
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Then X n matrix A has a set of k diagonal components Aj, As, .. . , 
A, if there exists a permutation matrix P such that P-'AP = diagonal 


(Ai, . . . ,Ax). Such a P exists, for example, if the matrix A is irreduci- 
ble (this is identical with the requirement of strong connectedness of the 
graph of A). 


A monic polynomial is one whose highest-power term has a coeffi- 
cient of unity. 

We have the following theorem: 

Theorem 6-12 If A is a matrix such that its directed graph is 
eyclically k-partite and if A,, As, . . . , Ax constitute a set of diagonal 
components of A*, then there exist a monic polynomial f(A) and a non- 
negative integer p such that f(0) ~ 0, the characteristic polynomial of 
A is f(\*)\”, the characteristic polynomial of A* is [f(\)]*\”, there exist 
nonnegative integers p;, . . . , p» with sum p such that the characteristic 
polynomial of A; is f(A)", i= 1, ... , &, and for any nonzero root 
of f(\) the elementary divisors are the same for each Aj,i = 1, ... , kh. 


Exercise 6-32 By using the illustration of Dulmage and Mendelsohn with 


show that the associated directed graph is cyclically 2-partite with partitioning 
Vi = {nivst Ve = {vane} 


and with corresponding cyclic components 


10 20 
B= (5 3) Bs = (5 4 


Show that the graph is also cyclically 2-partite with the partitionmg 


Vi = {vive} Ve = {vs,va} 


id Ci) ed CH 


Verify that the diagonal components of A relative to both partitionings are the same 
with 
2 0 2 0 
Aes a= (5 12) 


Finally, show that the desired polynomial is 


and that in this case 


f(A) = 8 — 14d + 24 


and that all the p’s for both cases are zero, 
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6-33 A RANKING PROBLEM 


Consider the problem of ranking the elements of a set 
S = {81,82 . . . ,8n} 


according to a linear ordering which is known to exist, but whose detailed 
structure must be learned by appropriate comparisons of pairs of ele- 
ments of S. Suppose we wish to rank a set of physical objects in order 
of increasing weight under the assumption that no two objects have 
identical weights. Or suppose that S is a set of individuals who are to 
be ranked with respect to their capability to defeat one another in some 
given form of competition. In this context it is necessary to assume 
that (1) any given individual invariably defeats, or is invariably defeated 
by, any given second individual and that (2) the relation “can defeat’ 
is transitive. 

In the first example the means for discovering the linear order of the 
elements is a succession of weighings with a balance scale. In the sec- 
ond example the means is a sequence of individual competitions between 
appropriate pairs of individuals. If ¢ consecutive competitions involve 
2t distinct individuals, these competitions may be regarded as being 
concurrent. However, we are interested here in the total number of 
individual competitions, not in the number of levels in a hierarchy of 
partially concurrent competitions. 

Returning to the general problem, we wish to find a procedure which, 
in the worst case, requires the fewest individual comparisons in order to 
completely rank the elements. Let Sp(n) be the maximum number of 
comparisons required to rank n elements, using procedure P, We seek 
a procedure which minimizes Sp(n). While we shall not formally define 
“procedure,” we roughly mean a rule for making the next comparison 
(or for terminating the process) which is completely determined by the 
outcomes of prior comparisons and which is implementable for any 
possible set of prior outcomes. 

The following procedure was given by Steinhaus.** Initially, pick 
and compare any two elements. In general, having completely ranked 
some subset of k elements, pick any (k+1)st element s,,: and compare 
it with the middle element of the k already ranked (either of the middle 
two if k is even). Depending on the outcome, next compare sp41 with 
the middle element in the set of elements which rank above or below the 
element on which the first comparison was based. By successive “halv- 
ings” of this type, determine the precise ranking of s,;; in the set of k + 1 
elements. With this ranking of k + 1 elements as a new departure point, 
insert some next element s;+2 in the ranking, ete. 
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As an example, suppose that we have already ranked s, to s;5 in the 
order 
81> 82>8>°*** > 84> 815 


and that the true rank of sis is between gio and 8. The sequence of 
comparisons is indicated by the ares in Fig. 6-75, in which the elements 
are represented by their subscripts. Thus s:5 is compared successively 
with 8s, $12, Si0, and 811. 

In general, one can show that at most 


S(k) = 1 + [loge k] 


comparisons are required in order to add a (k + 1)st element to a com- 
pletely ranked set of k elements, where [log: k] denotes the integral part, 
of log. k. If the above procedure is applied to rank n elements, by insert- 
ing them one at a time, it can be shown that at most 


M(n) = 1+ nS(n) — 25™ 
and at least 
L(n) = 1 + [logs (n!)] 


comparisons are required. 

Ford and Johnson®* have proposed the following more efficient rank- 
ing method, which employs the Steinhaus method. Assuming that 
there are 2r or 2r + 1 elements to be ranked, first form r disjoint pairs and 
compare them. Next, use the Steinhaus method to rank the r winners 
of these comparisons (i.e., the higher-ranked members of the r pairs). 
Figure 6-76, taken from Ref. 23, depicts the results of these rankings in 
the case of a 19-element set. The nine winners of the first-round com- 
parisons (in decreasing order of rank) are J, J, H,...,C, B. The 
nine vertices below these are the losers in the first-round comparisons, 
each being shown below the element with which it was compared. (The 
leftmost element was not considered in the first round.) Since the 10 ele- 
ments A, B, C,... , J have already been ranked, it remains to insert 
the remaining 9. The Steinhaus procedure is used for this purpose. 
The order in which the elements are inserted is indicated by the non- 
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parenthetical numbers in Fig. 6-76. This order is chosen in such a way 
that the maximum number of comparisons required is as small as possi- 
ble. The numbers in parentheses indicate the maximum number of 
comparisons required to insert the corresponding elements in their turn. 

It is shown in Ref. 23 that this method requires (for n elements) 
at most U(n) comparisons, where 


Ul) =0 U(2)=1 
k 
U(2k) =k + U(k) + ¥ TH 
2 


U@k +1) =k+ UK) + ¥ TH 
2 


and where 
TQ) =2 forl<i<3 
Ti) =3 for8<i<5 
oe ee 


For 1 <n < 11 and for n = 20 or 21, it is known that U(n) = L(n). 
It is not known whether or not this procedure is in general optimal. 


TQ) =j fo 


Exercise 6-33 Assume that the true rankings of A to O are 


A BC DeBsk ir FT Keer so 
6 7 2 9 43 32 6°14 2 4 6 30 eee 


Starting with a comparison of A and B, apply the Steinhaus method, adding C, D, 
... ,N,Ointhat order. (When there are two middle elements use the leftmost one.) 
Count the total number of comparisons required, and compare with M(15) and L (15). 


The ranking problem can be viewed from the standpoint of graph 
theory as follows. Let the elements of the set to be ranked be regarded 
as the vertices of a directed graph. Initially, the graph has no arcs. 
As each comparison is made, an are is added from the higher- to the lower- 
ranked member of the pair compared. (Owing to the transitivity of 
the order relation, additional ares are implied by those directly resulting 
from comparisons, but these need not be drawn.) The objective is to 
add enough ares so that one discovers a path which includes all of the 
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Figure 6-76 
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vertices (i.e., a hamiltonian path). The order in which the vertices are 
encountered along this path then determines their proper rank, Return- 
ing to the example of Fig. 6-75, after s; to s;; have been ranked, as indi- 
eated by the horizontal ares in Fig. 6-77, the rank of s;5 is determined 
by adding ares a, b, c, and d (in that order). Note that the vertex 
sequence 


1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 16, 11, 12, 13, 14, 15 


determines a hamiltonian path. A good ranking procedure is one which 
tends to determine such a path after adding the fewest possible arcs. 
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SEVEN 


NETWORK FLOWS 


7-1 INTRODUCTION 


If we assign to each are of a directed graph a rate of flow of a 
hypothetical substance, then the structure of the graph, together 
with the values of the are flows, becomes a useful model for cer- 
tain problems in transportation, communications, and other 
areas concerned with the actual or conceptual movement of com- 
modities, information, or people. In this chapter the concept of 
flow through a network (as a graph is called in this context) is 
formalized and the structure of flows is studied. The develop- 
ment centers around two general problems. The first is the 
problem of maximizing the total rate of flow between specified 
vertices when the flow in each are is constrained to lie between 
given bounds, e.g., maximization of traffic flow when each road 
segment has a limiting “capacity.” The second involves finding 
constrained flows whose costs are minimal when a cost per unit 
of flow is associated with each are. This subject owes L. R. 
Ford, Jr., D. R. Fulkerson, and G. B. Dantzig many of its basic 
concepts and results, e.g., the essential Max-Flow Min-Cut 
Theorem. See Ref. 10 of Chap. 1 for a comprehensive treatment. 
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7-2 BASIC TERMINOLOGY 


Tt is sufficient when studying flows to consider directed graphs which 
are connected and have no loops. These are called networks in this 
chapter. (Flows in disconnected graphs can be analyzed by considering 
each component separately, and flow in a loop would contribute nothing 
to the motion of flow between vertices. So the class of graphs which are 
networks is sufficiently inclusive for our purposes.) To emphasize the 
fact that a particular directed graph is a network, we shall represent it 
by N rather than the usual D. Given a network N = (V,A), a flowin N 
is an integer-valued function ¢ defined on A. The integer ¢(a) is called 
the flow in arc a. More specifically, if a ~ (v,w), the flow is said to be 
from v to w if g(a) > 0 and from w tov if g(a) < 0. 

Most of the concepts and results that will be presented here are 
particularly transparent if one considers a geometric realization of the 
network and interprets |¢(a)| as the constant rate of flow of a homoge- 
neous substance through a. The direction of flow is determined by the 
sign of g(a), using the above convention. 

The vertices of N are customarily classified, relative to ¢, on the 
basis of whether they create, conserve, or absorb flow. To formalize 
this, let »—> V denote the set of arcs having v as initial vertex and let 
V—p denote the set of arcs having v as terminal vertex. Then the 
integer Q(v,¢), defined by the relationship 


Qw,2) = Y e@ — ¥ va) 


vv Vor 


is called the net output at v, relative to ¢. (We shall write Q(v) if ¢ is clear 
from the context.) If ¢(a) > 0 in every are, the first summation is 
simply the total flow directed away from vertex v and the second is the 
total flow toward v. If some are flows are negative, the individual 
summations do not have these interpretations, but their difference 
still represents the net flow output. For example, in Fig. 7-1, we have 
v—>V = {as,02}, V0 = {as,a4}, 

and 4 2 

ag 


Qe) = [(2) + (-1)] 
= (4) oi -4 


Note that the net output at a ver- ° 

tex remains the same if we reverse 

the direction of an are and change -7 a =! 
a 2 


the sign of its flow. We could, i 
therefore, make all are flows non- Figure 7-1 
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negative without disturbing the net output at any vertex. However, it 
is more useful here to let the are directions remain unchanged, thus pro- 
viding a fixed frame of reference, and to allow negative flows in certain 
situations. 

We now group the vertices of N into sets V+, V-, and V° as follows: 


V+ = {ve VIQ(v) > 0} 
V- = {ve VIQ(r) < 0} 
Vo = {ve VIQ@) = 0} 


The elements of V+, V~-, and V° will be called sources, sinks, and inter- 
mediate vertices, respectively. Intuitively, these vertices respectively 
create, consume, and conserve flow. 

The network as a whole conserves flow, relative to any flow ¢, in 


the sense that 
y Qv) = 0 
This can be seen by writing 


YeM=F FY e@- P y e@ 
ve reV aeto—V) veV ae( V0) 
and noting that every are of the network appears precisely once in each 
of the double summations. In Fig. 7-2, for example, we have the follow- 
ing classification of vertices given in the accompanying table. If there 
is a single source v; and a single sink v,, the flow is said to be from v; to vj, 
and Q(v,), or, equivalently, —Q(v;), is called the value of the flow. [For 
convenience, this terminology will be extended to include flows of value 
zero, for which Q(v) = 0 for all vertices.] 


i | Qui) Type 

1 5 | source 

2 6 | source 

3 0 | intermediate 
4 0 | intermediate 
5 —10 | sink 

6 —1 | sink 


Any flow in a network can be transformed into a flow having a single 
source and sink by augmenting the structure of the network. Specific- 
ally, a new vertex w; is added, together with an are bj ~ (w,,v;) leading 
from w,; to each source %;. Such an arc is assigned the flow 


(bi) = Qe) 
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Figure 7-3 


Similarly, a second vertex we is added, together with an are c; ~ (v;,w») 
from each sink v; to wz, having flow 


o(c;) = —Q(r) 


To illustrate this, Fig. 7-3 shows the augmentation required to put the 
flow of Fig. 7-2 into this standard form. The resulting flow is a flow 
from w, to we having value 11. 

Most of the following material is related specifically to flows having 
a unique source and sink. However, in view of the above transforma- 
tion, the results are also applicable to flows having multiple sources 
and sinks. 


7-3 FLOW OPERATIONS AND RELATIONS 


Let ¢; and yg» be flows in the same network N = (V,A) and let p be 
an integer. Then flows ¢: + ¢2, ¢1 — ¢2, and pg are defined for every 
aeA by the relationships 


(g1 + ¢2)(a) = ¢1(a) + g2(a) 
(g1 — ¢2)(a) = ¢1(a) — g(a) 
(pe1)(@) = perla) 
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It is readily seen that 


Qe, gr + ¢2) = QQ, ¢1) + QQ, ¢2) 
Q,pe1) = PQ,¢1) 


It follows that if both g; and gs are flows from v to w, having values ky 
and ko, respectively, then ¢; + ¢» is also a flow from » to w and has value 
ki + ks. Similarly, g: — ge is a flow having value |k; — ks|, being a 
flow from v to w if ky > kz and from w to v if k; < ke. (A flow of value 
zero is considered as a flow from v to w and also from w tov.) Finally, 
pe has value |pk,|, being a flow from v to w if p > 0 and from w to v if 
pso. 


Exercise 7-1 In Fig. 7-2 assign a second flow, consisting of unit flow with each 
arc, and verify the foregoing relations. 


We shall write ¢; < ¢gs if and only if 
gi(a) < g(a) for every ae A 


The flow relations g: < ¢2, ¢1 > ¢2, and g; > ¢» are defined analogously. 
A flow ¢ is said to be bounded by ¢ and ¢» if ¢(a) lies between ¢;(a) 
and ¢2(a) for every are of the network. It is evident that a sufficient 


condition for ¢ to be bounded by ¢; and ¢» is that either 
aSeSe or gS ese 


However, it is also permissible for g to be bounded above by y; for some 
ares and by ¢» for others. 
Flows ¢; and ¢g» are said to be conformal if 


¢i(a) « g(a) > 0 for every ace A 


Expressed differently, ¢; and ¢: are conformal if there is no are a such 
that ¢i(a) and g2(a) are both nonzero and oppositely oriented. Flows 
{v1,¢2, - . - ,gn} im the same network are said to be jointly conformal 
if they are pairwise conformal. 


Exercise 7-2 Verify whether the flow assigned in Exercise 7-1 and that of Fig. 
7-2 are jointly conformal. If they are not, assign a flow that is conformal with the 
flow of Fig. 7-2. 


As we shall see shortly, any flow in a network can be conveniently 
regarded as being composed of a collection of particularly simple flows 
(defined in the next section) associated with chains and circuits in the 
network. Moreover, these simple flows are jointly conformal, i.e., they 
all utilize any given are in the same direction. This is a simple but 
important fact in many application contexts where conflicting (noncon- 
formal) sets of flows have no usefulness, and possibly no interpretation. 
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Theorem 7-1 relates the concepts of conformal flows and bounded flows, 
and it plays a basic role in later discussions. 

Theorem 7-1 IfS = {¢1,92, . . . ,¢n} isa set of jointly conformal 
flows in a network N, y is any flow in N, and T C S, then y + »» ¢ is 


bounded by y and y + >; $i. 
s 


Proof: Let a be any are. If ¢(a) > 0 fori =1, 2,...,n, then 
clearly 


v(a) < ¥(a) + ¥ ea) <¥@) + Y (a) 
Similarly, if g(a) < 0 fori = 1,2, ...,n, then 
¥(a) > ¥(a) + EO) > ¥(a) + » (a) 


Since one of these two situations applies for each a A, the result follows. 


Exercise 7-3 Let ¢: be the flow assigned in Fig. 7-2 and let ge be the flow 
assigned in Exercise 7-2. Take S = {gi,gs} andlet 7 = {yi}. Assign an arbitrary 
flow y and verify the statement of the theorem. 


7-4 SIMPLE FLOWS 


In a given network N = (V,A) let C be a set of arcs which form 
either a simple chain or a simple circuit in the associated undirected 
graph. Assume, moreover, that we orient the edges by traversing this 
simple chain or circuit in one of the two possible manners. An arc of 
C will be called normal if its direction agrees with the induced orientation 
and inverted otherwise. 

Next, let gc be defined on A as follows: 


+1 for normal ares of C 
gce(a) =} —-1 for inverted arcs of C 
0 for ares not in C 


Figure 7-4 shows a simple chain C oriented from v to w, an oriented simple 
cireuit S, and the associated functions g¢ and ¢s. 

One may easily verify that if C is a simple chain joining v to w and 
oriented from v to w, then ¢c is a flow from v to w in N whose value is 1, 
called a simple chain flow from v tow. Similarly, if S is a circuit oriented 
in one of the two possible ways, gs is a zero-valued flow in N called a simple 
circuit flow. Flows of these two types (and only such) are referred to as 
simple flows in the following pages. Our point of view will be to consider 
simple flows as the building blocks from which arbitrary flows may be 
synthesized and into which they may be resolved. 
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Figure 7-4 


7-5 AN ALTERNATIVE FLOW FORMULATION 


It is sometimes convenient to consider an alternative representation of a 
network flow in which both the network configuration and the values of 
arc flows are reflected in the structure of an associated directed graph. 
Let y denote a flow in a network N = (V,A). We construct a directed 
graph U(N,v) = (V',A’) whose vertices are in 1-1 correspondence with 
those of N. For each are ae A such that g(a) ~ 0 we include |¢(a)| 
strictly parallel ares in A’ joining the corresponding vertices. If a > (v,w) 
and g(a) > 0, these ares are taken to be ~(v’,w’) in A’. If g(a) <0, 
they are taken to be ~(w’,v’). Thus the directions of ares of U(N,) 
indicate the true directions of are flows in N, while their numbers reflect 
the magnitudes of arc flows in N. The directed graph U(N,¢) is called 
the unitary graph associated with flow ¢ in network N. Figure 7-5 
exhibits the unitary graph associated with a specific N and ¢. 

Note that U(N,¢) may not reflect the complete structure of N, since 
ares ae A for which g(a) = 0 are not represented. In particular, U(N,¢) 
may fail to be connected, and hence may not constitute a network in its 
own right. Except for this technicality, we have essentially eliminated 


Figure 7-5 
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the need for the flow function by producing a network with unit flow in 
every arc. 


Exercise 7-4 Give the unitary graph associated with the augmented network of 
Fig. 7-3 by using the flow given in Fig. 7-2. 


Tf v’ denotes the vertex in U(N,¢) corresponding to » in N, it is easily 
seen that 


atv’) — F-(v') = QW¢) 
In particular, if y is a flow of value k from »; to v;, then U(N,v) is pseudo- 
symmetric except possibly at v; and v;, and for these vertices we have 


+) — FO) = F() — Ht) =k 


Recalling Theorem 3-7, it follows that U(N,g) can be covered by k simple 
paths from v; to v;, together possibly with some simple cycles. This fact 
plays a central role in the subsequent development, in which more general 
flows are decomposed into or synthesized from appropriate simple flows. 
Theorem 7-2 is a restatement of this result in the language of flows. 
Theorem 7-2 A flow ¢ from v to w having value k can be written 


as a sum 
n 
o= da 
T 


for some appropriate n > k, where the o,’s are conformal simple flows, k 
of which are simple chain flows from v to w, the rest being simple circuit 
flows. 

We shall call such a set of conformal simple flows a decomposition of 
y. Naturally, a given flow may have more than one decomposition. In 
Fig. 7-6 two decompositions of the same flow of value 3 from v to w are 
shown. Note that one of these includes a simple circuit flow and the other 
does not. 


Exercise 7-5 Apply Theorem 7-2 to the flow in Fig. 7-2, augmented as in Fig. 
7-3, to obtain a specific set of simple flows o;. 


Figure 7-6 
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7-6 FLOWS CONSTRAINED BY ARC BOUNDS 


A network N = (V,A) is said to be bounded if two integer-valued func- 
tions a and § are defined on A and satisfy the relationship 


0 < a(a) < B(a) for every ae A 


The integers a(a) and A(a), called the lower and upper bounds on are a, 
are interpreted as representing the minimum and maximum permissible 
values of flow in each are. When a(a) = 0 for all ae A, B(a) is usually 
called the capacity of are a, and the network is said to be capacitated. A 
flow ¢ in N is said to be feasible if and only if 


a(a) < g(a) < B(a) for every aeA 


If ¢: and ¢» are feasible flows, and if ¢ is any flow bounded by ¢; and ¢», 
then clearly ¢ is also feasible. The basic questions to be considered here 
are these: 


1. Under what circumstances do feasible flows from v to w exist in a 
given bounded network? 

2. If any feasible flows exist, what is the character of the set of values 
realized by such flows? 

3. If feasible flows from v to w having value k exist, how can we find 
a specific flow of this type? 


In the remainder of this section we assume that the bounded network 
N = (V,A) under consideration has n vertices indexed as 14, v2, . . . Un 
and that feasible flows from 2; to v, are the only flows of interest (if we 
include zero-valued flows). 

As a first step toward answering the above questions, we establish 
the fact that the are bounds associated with every cut-set separating v1 
from v,, determine lower and upper bounds on the possible values of feasible 
flows. If X and Y are (not necessarily disjoint) sets of vertices, X — Y 
denotes the set of ares of the form a = (x,y), wherexe Xandye Y. Also, 
X’ denotes the complement of X relative to V. Recalling the terminology 
related to cut-sets, if X and X’ are both nonempty, then X — X’ and 
X’— X are the two directed cut-sets associated with the vertex partition 
{X,X’}. 

Theorem 7-3 If ¢ is any feasible flow from »; to v, and W is any 
set of vertices such that v; « W and v, « W’, then the value k of ¢ must 
satisfy the inequalities 


a _ <k< - a 
Ya — Fal) <k< Yala) — Fate) 


WwW’ 
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Proof: Since Q(vx) = k and Q(v) = 0 for all other ve W, we have 
ki ¥ Qo) = >; g(a) — S ¢(a) 
Ww 


woV vw 
This can be simplified to 
k= ¥ e@—- Y ea) 
ww’ wow 
Since ¢ is assumed to be feasible, the relationship 
a(a) < ¢(a) < B(a) 


holds for every arc. By appropriate substitution of these bounds in the 
expression for k, the desired inequalities are obtained. 

Note that W was arbitrary in Theorem 7-3, except for the requirement 
that ve W andv,¢« W’. If we define 


es ae oo “aap en aay 
a = nga (3, 20 ~ 2, 20) 


where W ranges over all such sets of vertices, we conclude that a necessary 
condition for the existence of any feasible flows from v; tov, is that M < M. 

Consider the network of Fig. 7-7, where the pairs of integers shown 
with certain arcs a represent the flow bounds a(a) and 8(a). Taking 
W = {v4,02,v3,04}, note that 


¥ ae@—- Y B@)=10-2=8<M 
wow 


ww 


and 


On the other hand, taking W = {v;}, we have 
Ba)—- aa) =7-1=62>M 


wow wow 


so the minimal (net) flow which 
may feasibly cross the first of these 
cut-sets is not compatible with the 
maximal flow which may feasibly 
cross the second. Consequently, 
no flow in this network can be fea- 
sible. (For this reason, the values 
of a and 8 on the two vertical ares 
are immaterial.) 


Exercise 7-6 Use Fig. 7-7 to take 
every possible partitioning of the vertices Figure 7-7 
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into W and W’ when 2: « W and »;eW’. Evaluate the summations in Theorem 7-3 
and find Mand M. (Assume that a = 8 = 3 on the two vertical arcs.) 


Naturally, in a complex network having a large number of vertices 
it would not be practical to test all partitions {W,W’} in order to evaluate 
Mand M. Besides, even if this were practical, what could we conclude 
if M < M? So far, we have only established that this inequality is nec- 
essary for the existence of feasible flows. We need a corresponding the- 
oretical result which asserts that this inequality, or something like it, is 
also sufficient for the existence of feasible flows. The remainder of this 
section is devoted to developing such a result. In the course of doing so, 
a practical procedure for realizing flows of all possible values will emerge. 

Theorem 7-4 establishes the fact that if an initial feasible flow from 
v; to v, is known, then feasible flows having all possible values can be 
obtained by adding or subtracting appropriate simple chain flows. 

Theorem 7-4 Let ¢ be a feasible flow from v; to v, having value k. 
If there exists a feasible flow y* having value k* # k, then there exist 
m = |k* — k| simple chain flows {o1,02, . . . om} from v; to vp, such that 


m 

et »; a; is a feasible flow having value k*. (The plus sign is appropriate 
1 

if k* > k; the minus sign if k* < k.) 


Proof: Suppose that k* > k. Then ¢* — ¢ is a flow from x; to vp, 
having value m. By applying Theorem 7-2, we can write 


OF = p= on es? ? te here tae 
where the o;’s and ¢,’s are conformal simple chain and cireuit flows, respec- 
tively. (The circuit flows are indefinite in number and may be absent 
altogether.) Then, by using Theorem 7-1, we find that ¢ + Si is 
bounded by ¢ and ¢ + (¢* — g) = ¢*. Being bounded by two feasible 
flows, ¢ + > a; is necessarily feasible, and it clearly has value k + m = k*. 
This inti of the case in which k* > k. The case in which k* < k can 


be treated in an analogous manner, the details of which are left to the 
reader. 


Exercise 7-7 Use the notation of Theorem 7-4 to draw a network and two flows 
gand ¢* (between the same pair of vertices) having values k = 2andk* = 5. Deter- 
mine the flow »* — ¢ and write it as the sum of conformal simple flows. 


Remark: We can now rigorously verify the not-very-surprising fact 
that the set of values (if any) realized by feasible flows is a set of consecu- 
tive integers. That is, if feasible flows having values k and k* > k exist 
and if r is an integer such that k <r < k*, then a feasible flow having 
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(b) 


Figure 7-8 


value r exists. To see this, consider the flow g + 2o;, where the summa- 
tion includes any r — k of the simple chain flows of the proof of Theorem 
7-4. This flow has value r, and it is feasible for the same reasons as in 
that proof. 

It is not only theoretically possible but also quite practical to synthe- 
size flows having prescribed values by adding (or subtracting) appropriate 
simple chain flows to some starting feasible flow. Before describing this 
process of synthesis precisely, we shall illustrate its main features infor- 
mally and use the capacitated network shown in Fig. 7-8 as an example. 
The numbers shown in Fig. 7-8a represent the are capacities, and v and 
w are the intended source and sink, respectively. It is assumed, more- 
over, that the network is symmetric, so that flow may be in either direc- 
tion in any arc, provided that it is not greater than the corresponding 
capacity. 

Our object is to maximize the flow from v to w. The maximal value 
is clearly at most 5, because of the capacities of the arcs incident with v. 
To begin, we take any path joining v and w such as the one determined by 
the vertex sequence 

v, Y, t, w 


and assign to each are of the path a flow of 2. (Any larger value would 
violate the capacity of some arcs.) Arcs not in this path are initially 
assigned a value of zero. The resulting flow is shown in Fig. 7-8b. 

We now seek any second path which avoids saturated ares [i.e., ares 
like (v,y) and (z,w) for which flow already equals capacity]. Take, for 
example, the path determined by the vertex sequence 


v, Z, Zw 
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ba 


Figure 7-9 


and assign one unit of flow to each of its ares. The resulting flow, when 
added to that of Fig. 7-8, is shown in (c). In this figure, three arcs 
(those marked with X’s) are saturated. Note that since we must avoid 
them, the only way in which we can increase the value of flow from v to 
w is to negate our earlier decision to send flow in the direction from y to z. 
To do this, we add a flow of two units in the chain determined by the 
vertex sequence 
v, 2, Y, zw 


resulting in the flow shown in (d). The effect is to retract or cancel the 
flow in the edge joining x and y and to obtain a total flow whose value is 
clearly maximal. 

We now return to a formal discussion of this approach of synthesizing 
flows by means of chain flows. The procedure is most easily described in 
terms of an auxiliary directed graph. If ¢ is a feasible flow in a network 
N = (V,A), then the associated incremental graph I(N,¢) is a directed 
graph having the same vertices as N and ares determined as follows: For 
every arc a~ (v,w) in N, I(N,¢) includes are a if g(a) < B(a) and a 
“reverse are” a’ ~ (w,v) if g(a) > a(a). If a or a’ is present in I(N,¢), 
then it is possible to add or subtract, respectively, one unit of flow in a 
without destroying feasibility in this are. Figure 7-9 (right) shows the 
incremental graph associated with the network and feasible flow shown 
in Fig. 7-9 (left). [Here the triple of numbers z, y, z associated with each 
arc represent «(a), g(a), and B(a), respectively.] 

Every simple path from »; to v, in J(N,g) determines a simple chain 
flow from v; to v, in N which can be added to ¢ without disturbing feasibil- 
ity. In Fig. 7-9 (right) the path {a1,a;,as} in J(N,¢) determines thesimple 
chain flow o in N having a, and a; as normal ares and a; as an inverted 
arc. Similarly, every simple path from v, to v; in I(N,g) determines a 
simple chain flow o* from 1; to v, such that g — o* is feasible in N. The 
presence of the path {a;,a;} indicates that the chain flow having a; and a, 
as normal ares can be subtracted from ¢ without destroying feasibility. 
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It is apparent that if ¢ is a maximal-valued feasible flow, then there 
exist no paths from »; tov, inI(N,¢). Similarly, if g has minimal possible 
value, then there exist no paths from v, to v1, except that such paths may 
exist if ¢ has value zero. [Recall that, according to our definition, the 
value of a flow is required to be nonnegative. If J(N,¢) contains a path 
from v, to v; and ¢ has value zero, then ¢ — o will be feasible for the cor- 
responding chain flow o, but it will fail to be a legitimate flow from 1 
to vp.) 

We saw earlier that a necessary condition for the existence of feasible 
flows from v, to v, is that M < M. If we let M* denote max {M,0} to 
avoid flows having negative values, Theorem 7-5 shows that * and M 
are precise bounds on values realized by feasible flows, if any feasible 
flows exist. 

Theorem 7-5 If there exist any feasible flows from v, to v,, then 
M* and M are respectively the minimal and maximal values realized by 
feasible flows. 

Proof: Let y be a feasible flow from v; to v, such that J(N,¢) contains 
no paths from v; tov». (We can always produce such a flow by starting 
with any feasible flow and adding a succession of simple chain flows until 
a point when the incremental graph has no paths from 2; to v, is reached.) 
Let W consist of v; and all vertices reachable from v, by paths. Then 
v, €W’ by assumption. Moreover, for every are in W— W’ we must 
have y(a) = §(a), and for every are in W’ + W we must have ¢(a) = a(a). 
(Why?) But we know that the value of ¢, say, k, is given by 

k= a) — = ‘a) — a(a 
ape 2 ~ Dy MO = YB) = Yat) 
Since 
ks Y B@—- Y a) 
wow wow 
for every partition {W,W’"}, it follows that this particular W must realize 
M and that the value of ¢ is M. 

An analogous line of reasoning establishes the other half of the the- 
orem. In this case, starting with any feasible flow, we can subtract 
successive chain flows until either a feasible flow of value zero is reached 
or else no path from v, to v, exists in the current incremental graph I(N,¢). 
In the former case M* = 0 and a zero-valued flow exists, so we are fin- 
ished. In the latter case, let W be the set of vertices not reachable from 
v, by pathsin J(N,g). Then ¢(a) = a(a) for arcsin W > W’, g(a) = (a) 
for arcs in W’ > W, and the value k of ¢ satisfies the relationship 


k= ¥ o@)- ¥ e@= ¥ a@)- Y BG) 


wow’ wow wow’ wow 
so a feasible flow realizing the value M* exists. This completes the proof. 
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When a(a) = 0 for every arc, feasible flows necessarily exist and 
Theorem 7-5 is known as the max-flow min-cut theorem.?2*-4 


7-7 MAXIMAL FLOWS IN CAPACITATED NETWORKS 


In a capacitated network, a starting feasible flow having minimal value, 
namely, the flow which is identically zero, is always available. Let go 
denote this flow. We can then use the incremental procedure indicated 
in the proof of Theorem 7-5 to find a succession of flows ¢1, ¢2, ¢3, . . - 
having values 1, 2, 3,.... Ultimately, we shall arrive at a flow gy 
having maximal value. According to Theorem 7-5, this will occur when 
k = M, at which time one or more cut-sets {W,W’} will be saturated, 
meaning that g(a) = 8(a) for every ae W — W’ and ¢;(a) = 0 for every 
aeW’— W. To describe an algorithm for finding a succession of feasible 
flows, let us modify slightly the incremental graph I(N,¢) described 
earlier. For every are a ~ (v,w) in N, we now assume that both a and 
a’ = (w,v) are present in I(N,g). However, we associate a length with 
every arc in (N,¢) as follows: 


_{0  ife@ <s@) 
ane | ® if g(a) = B(a) 

Wena & if g(a) > a(a) 
aan 2 if g(a) = ala) 


[Note that we could use some positive number rather than « here. How- 
ever, our convention will allow us to state the algorithm for minimizing 
cost (discussed later) with a minimum of modification.) Thus ares which 
were formerly deleted from J(N,g) are now retained, but have infinite 
length, and ¢ is maximal if there are no paths of zero length from 2 to vp. 

In the following algorithm, J denotes the associated incremental 
graph (whose structure now does not vary with ¢) and dx denotes the dis- 
tance function related to the k-valued flow ¢. 


Maximal-flow Algorithm 

1. Initially, set 7 = 0 and take as go the flow which is identically zero 
in every are. 

2. Determine the shortest distance from v; to v, in J, relative to the 
distance function ;. (This step can be taken by using the labeling 
technique described in Chap. 3.) 

3. If the distance determined in step 2 is finite, let C denote any 
simple path from v; to », with shortest length and let o, denote the cor- 
responding simple chain flow in N. Define giz: = ¢; +0. and repeat 
step 2, with 7 + 1 in place of ¢. 

4. If the shortest distance from v; to v, is ©, then ¢; is a flow having 
maximal value, and the algorithm may be terminated. 
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There is, of course, one serious drawback to this procedure, namely, 
that the value of flow is increased by only one unit for each repetition of 
step 2. In practice, we can expedite matters considerably. Having 
found a path C in step 3, we can perform a test to see how many units of 
flow may be routed this way. Expressed differently, we can determine 
the largest integer ¢ such that ¢; + to. is a feasible flow. To determine t, 
note that, for every are a ~ (v,w) such that \;(a) < », we can add up to 
B(a) — ¢i(a) units of flow to ¢,(a) without disturbing feasibility. Sim- 
ilarly, if \;(a’) < ©, a maximum of ¢,;(a) — a(a) may be subtracted from 
g(a). The integer ¢ is taken to be the minimum of these incremental 
quantities associated with ares in C, and step 3 of the algorithm is modified 
to the extent that we now define not gix1 but gis: = gi + tor. 

Exercise 7-8 Consider Fig. 7-8, for which we have already informally obtained 


a maximal-valued flow. Solve this problem formally by following the sequence of 
steps specified by the maximal-flow algorithm. 


7-8 MAXIMAL FLOWS IN GENERAL BOUNDED NETWORKS 


If a(a) > 0 for one or more ares, but a feasible flow ¢; having value 7 is 
known, the procedure described in the preceding section may still be 
employed to generate feasible flows with values increasing from i to M 
or with values decreasing from i to M*. (In this case, step 2 must be 
modified to find the shortest distance from v, to v; in the current incre- 
mental graph.) However, difficulty arises if an initial feasible flow is not 
known. Note that in this case the flow which is identically zero is specif- 
ically nonfeasible. 

To find a starting feasible flow, we define an auxiliary problem in a 
related network N’ (Ref. 5 of Chap. 1). The network N’ = (V’,A’) is 
obtained from N = (V,A) as follows: The vertex set V’ consists of the 
vertices v1, ... , ¥, of V, together with two additional vertices v) and 
Vnyi- (These will be the source and sink, respectively, of flows in the 
auxiliary network.) 

For every arc a = (v,w) « A,A’ has three corresponding arcs: 


a’ = (v,w) 
a” = (v9,w) 
al” = (v,0n41) 
These ares are assigned flow bounds as follows: 
a'(a') = a'(a") = a'(a’") = 0 
B'(a’) = B(a) — a(a) 
B’(a"’) = B’(a"”’) = aa) 
Note that for ares such that a(a) = 0, construction of a” and a’” is super- 
fluous, since a = 8 = Oforsucharcs. In practice, these arcs are omitted. 
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a, 24 


Lit a3 


ay as 


Figure 7-10 


In theory, however, it is simplest to suppose that the three arcs a’, a’’, 
and a’” are invariably included. 
One final are b’ having the following characteristics is added to N’: 


=~ (nm) a(b') =0 Bb) =K 


where XK is an integer larger than the value of any possible feasible flow in 
the original network N. Figure 7-10 shows the structure of the auxiliary 
network N’ associated with a given network N. 

Remark: It should be noted that N’ is completely determined by NV 
and its constraining functions a and 8. If N is a complex network, it 
would clearly be very troublesome, at best, to actually construct N’. 
However, a digital computer can be used to good advantage to construct 
the auxiliary network and to implement the computational algorithm. 


Exercise 7-9 In Fig. 7-7 complete the specification of « and 8 by setting a = 1 
and 8 = 3 for each of the two vertical arcs. Construct the auxiliary network N’ by 
taking », as source and vs as sink. Indicate the value of a’ and §’ for every are of N’. 


The importance of N’ stems from two facts. First, N’ isa capacitated 
network, and the preceding section provides a means for finding a max- 
imal flow in such a network. Second, we can show that a maximal flow 
g’ in N’ can be easily transformed into a feasible flow ¢ in N, provided 
that any feasible flows exist. If none exist, this too can be easily deduced 
from the characteristics of y’. 

A feasible flow ¢’ from vo to vps; in N’ is called a saturating flow if 
¢’(a”’) = B'(a") for every arc a” from vo. Such a flow (if it exists) is 
necessarily maximal in N’. (Why?) An equivalent characterization is 
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that ¢’(a’”) = 6’(a’”) for every are terminating at rn41. If ¢’ is a sat- 
urating flow in N’, consider the following function ¢ defined on the ares 
of N: 

(a) = ¢'(a’) + a(a) 


With ¢ defined in this manner, one can verify the following theorem: 
Theorem 7-6 If ¢’ is a saturating flow in N’ and ¢ is defined as 
above, then ¢ is a feasible flow from v; to v, in N. Moreover, the value 
of gis ¢'(b’). 
Exercise 7-10 Prove Theorem 7-6. Note that three things must be established: 
1. (a) < g(a) < Bla) for all a 
2. Avie) =0 fori = 2,3,...,n—1 
3. Qivie) = o'(') 


So if, in maximizing flow through N’, we obtain a saturating flow ¢’, 
we can easily obtain a feasible flow gin N. On the other hand, if a max- 
imal flow in N’ fails to be a saturating flow, Theorem 7-7 asserts that no 
feasible flows in N exist. 

Theorem 7-7 If ¢’ is a maximal flow from v9 to vp41 in N’ but is 
not a saturating flow, then no feasible flows from v, to »v, exist in N. 

Proof: Let W consist of those vertices, other than vo, which can be 
reached from v» by paths of finite length in the incremental graph I(N’,¢’). 
Let W’ denote the vertices not reachable, other than v,;1. Since ¢’ is 
not a saturating flow, both W and W’ are nonempty. (Verify this in 
detail.) The capacity (i.e., sum of upper flow bounds) of the cut-set 
W — W’ determined by the partitioning of V’ into W = {vo} U W and 
W’ = {vnyi} U W’ is given by 

by Ba”) + y B'(a’) + > B’(a’”’) 
ww! wow Wins 

By using the fact that 6’ = 8 — a for arcs of type a’, while p’ = a 
for ares of types a” and a’”, k can be written as 


k= Y a@)+ Y a@+ ¥ p@— YF ala) 


wow’ wow wow wow 


+ Y aa + Y ala) 


wow ww’ 


Since ¢’ is not a saturating flow, we also have 


b> Yala) + FY ala + FY al) + Yala) 


wow wow’ wow 


Upon combining the last two relationships, we find that 


> B(a) — y a(a) <0 


ww Www 
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It follows that for any feasible flow in N we must have 


X e@- Y e@)<o 
wow wow 
(Why?) On the other hand, if x; and v, are both in W or both in W’, 
any flow ¢ from »; to v, must satisfy the equation 


> -@- Y e@=0 


ww wow 
while if v; ¢ W and v, « W’, ¢ must satisfy the equation 


Y e@- Y e@=k>0 

wow wow 
where k is the value of y. We conclude that no feasible flows from v, to 
v, in N exist, except possibly for the case not considered: when v, « W and 
v,eW’. But this case never applies, because if we can reach v, via an 
unsaturated path in N’, we can also reach 1, since the arc b' is never 

saturated. This completes the proof. 
Exercise 7-11 Apply the maximal-flow algorithm to the capacitated network N’ 


constructed in Exercise 7-9. Show that the maximal flow is not a saturating flow 
and hence according to Theorem 7-7 no feasible flows in N exist. 


Recapitulation 


To summarize the principal findings thus far in the chapter, we have 
seen that we can determine a maximal flow in a capacitated network by 
starting with the identically zero flow and adding a succession of simple 
chain flows from source to sink. The last flow in the sequence has max- 
imal value, and this is detected when, in the associated incremental graph, 
no further paths having finite length join v; to v,. Every intermediate 
flow determined during this procedure is also feasible, so we have in fact 
generated feasible flows realizing all possible values. Moreover, it is 
possible to accelerate the procedure by taking appropriate multiples of 
each chain flow. For other networks, for which a(a) > 0 for some or all 
ares, it is necessary first to construct an associated capacitated network 
N’ and maximize flow from vo to p41 in the above manner. The maximal 
flow ¢’ then either leads to a corresponding feasible flow ¢ in N or else 
definitely indicates that no such flows exist. In the former case ¢ can be 
taken as the starting point for a sequence of increasing (decreasing) flows 
each obtained from the preceding one by adding (subtracting) an appro- 
priate chain flow. The same device may be employed to accelerate find- 
ing the maximal (minimal) flow, once ¢ has been determined. Thus, for 
any bounded network, we have a practical means for generating feasible 
flows between a fixed pair of vertices having all possible flow values. 
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7-9 MINIMAL-COST FLOWS 


If N is a bounded network and M* <k < M, there are generally 
numerous feasible flows from v; to v, having value k. In that case it is 
frequently desirable to select from among them one which minimizes a 
quantitative measure which we shall call ‘‘cost”’” but whose true interpreta- 
tion varies with the application. 

We continue to assume that N = (V,A) is a given network with are 
bounds a(a) and B(a). In addition, we now assume that a real-valued 
cost function y is associated with N, which assigns to each arc ae A a unit 
cost y(a) > 0. If ¢ is any feasible flow, the total cost of ¢ is denoted by 
y(¢) and defined by the relationship 

v(¢) = ¥ (a) - e(a) 
acd 

If a= (v,w), y(a) can be interpreted as the cost associated with 
sending one unit of flow from v to w through a. Then y(¢) is merely the 
total cost of flowin the network. Itisimportant to note that this formula- 
tion is an appropriate model for evaluating the cost of flows only when the 
cost of flow in each are can be assumed proportional to the magnitude of 
flow. 

For the moment, we shall restrict attention to capacitated networks 
(for which a = 0). The basic problem we shall consider is this: given a 
capacitated network N and a cost function y, how can we find a feasible 
flow ¢ from x; to v, whose value is a prescribed integer k and whose cost is 
minimal? 

Since y(¢) > 0 for every feasible flow ¢, the feasible flow ¢o which is 
identically zero is clearly a minimal-cost feasible flow having value zero. 
Moreover, we shall see in this section that if ¢, is a feasible flow of value 
k whose cost is minimal, we can find an appropriate chain flow o such that 
¢k41 = ¢ +o is a minimal-cost feasible flow of value k + 1 (except, of 
course, if g is maximal). This being the case, we can start with go and 
generate a succession of minimal-cost flows ¢1, ¢2, . . . until a flow having 
the desired value k is reached. For an alternate procedure based on 
linear programming concepts, see Fulkerson.? 

If ¢ is a feasible flow from v; to v, having value k, let 1(N,¢) denote 
the associated incremental graph. For each arc a in the original network, 
define a length \(a) as follows: 


(a) = (a) if g(a) < Ba) 
Xa) = & if (a) = B(a) 


Thus \(a) represents the unit cost of a, with the convention that the unit 
cost is infinite if a is already saturated. 
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Similarly, for each are a’ in I(N,¢) which corresponds to the ‘‘reverse”’ 
of an arc a in N, define \(a’) by 


Xa’) = —y(a) if g(a) > 0 
Aa’) = @ if g(a) = 0 


If P is a simple path from x; to v, or a simple cycle in J(N,¢), then 
MP) = » (a) is readily seen to be the incremental cost associated with 


adding to ¢ the simple chain or cycle flow cp determined by P. Expressed 
symbolically, 


v(¢ + or) = y(v) + MP) 


This will be true even for infeasible flows y + op if we adopt the conven- 
tion that all infeasible flows have infinite cost. 

Assuming that ¢ is not maximal, then a shortest path Po» from 2; to 
v, in I(N,¢) clearly determines the best possible chain flow with which to 
augment ¢; since if \(Po) < A(P), then 


av(¢ + op,) = v(¢) + Po) < v(v) + MP) = v(¢ + oP) 


There is one difficulty, however: it may happen that \(Z) < 0 for some 
simple cycles Z in I(N,¢). We saw in Chap. 3 that our technique for 
finding shortest simple paths through a directed graph breaks down in 
this case. Fortunately, if ¢ is a minimal-cost flow, this cannot happen. 
In fact, Theorem 7-8 characterizes minimal-cost flows in terms of cycles of 
negative length. 

Theorem 7-8 A feasible flow ¢ having value k minimizes cost over 
all feasible flows of value k if and only if \(Z) > 0 for every simple cycle 
Zin I(N,g). 

Proof: If \(Z) < 0 for some simple cycle Z, then the corresponding 
simple circuit flow ¢, is such that 


y(¢ + 22) = ¥(¢) + Moz) < r(¢) 


and hence ¢ is not a minimal-cost flow. So if ¢ minimizes cost, then 
(Z) = 0 for every cycle Z. 

To prove the converse, assume that \(Z) > 0 for every cycle Z but 
that ¢ is not a minimal-cost flow. (We shall arrive at a contradiction.) 
Let ¢’ be a minimal-cost flow having value k. Then we can write 


eg —eg=atort::* tom 


where the o;’s are conformal circuit flows associated with cycles Z;, Zs, 
...,ZminI(N,g). Relative to the distance function \ associated with 
I(N,g), we have (by assumption) 


MZ) > 0 
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and hence 
ve + 01) = v(y) + MZ:) = r(¢) 


Now let \; denote the distance function associated with 7(N, 9 + 0:). 
Clearly, \(a) and \,(a) are equal on ares not members of Z,, since their 
flow is not altered. For ares in Z, which are in N, \,(a) = © or else 
(a) = (a), depending on whether ais now saturated or not. Similarly, 
for ares a’ in Z; which are the “reverse” of arcs in N, we have \,(a’) = © 
or \,(a’) = (a’), depending on whether arc a is now empty or not. The 
only types of arcs in the incremental graph for which \; < \ are those 
which are the opposites of arcs in Z, (that is, arc a if a’ e Z and arc a’ if 
aeZ,). But since the o;’s are conformal, such ares cannot appear in any 
of the remaining Z;’s, and their new lengths are irrelevant in the remaining 
argument. 
As a result of the above relationship between \; and \, we have 


Mi(Z2) > A(Z2) > 0 
and so 
y(e + 01 +02) = y(y + 01) + (Za) > (¢) 


By repeating this process m times, we eventually find that 


vet+tot+ +++ tom) > ¥(¢) 
But since 
etoats + tom=¢' 


this contradicts the assumption that 


ve’) < v(¢) 
and completes the proof. 
Exercise 7-12 Prove that if ¢ is a minimal-cost feasible flow in N, then there 


exists at least one other feasible flow ¢’ in N having the same value and also minimizing 
cost if and only if y(C) = 0 for some cycle in I(N,¢). 


In a capacitated network, the statement of an algorithm which will 
find minimal-cost feasible flows of all possible values from 0 to M is for- 
mally the same as the maximal-flow algorithm of Sec. 7-7, with the under- 
standing that now the appropriate length function \ to use at each stage 
is the one defined here in terms of incremental are costs. 

Figure 7-11 illustrates the algorithm for finding minimal-cost flows 
for a very small network. After studying this example, the reader can 
best gain real familiarity with the procedure by applying the algorithm to 
various networks of a somewhat more complex variety. For very complex 
networks, manual implementation becomes impractical. However, net- 
works with several thousand ares can be readily handled by means of a 
digital computer. 
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The top network in Fig. 7-11 shows the structure of the network and 
gives the unit costs and capacities of all arcs. The left-hand column of 
graphs is the succession of incremental graphs which must be considered. 
The numbers associated with the arcs are their lengths, arcs with infinite 
length being omitted, and the arcs of a shortest path from source to sink 
are shown as bold ares. The right-hand column of graphs shows the 
succession of minimal-cost flows, of increasing value, obtained by succes- 
sively adding the chain flows associated with the shortest paths in the 
incremental graphs. The numbers associated with the arcs are the cor- 
responding are flows. In all, four chain flows, which had associated 
values of 5,2,3, and 1, respectively, were found. The final flow of value 
11 is clearly maximal, since both the ares leading into the sink are 
saturated. 

Note that the first incremental graph J(N,¢o) is structurally identical 
with the given network. (Here ¢gois, of course, the flow which is identically 
zero in all ares.) This is the case because no arcs are yet saturated and 
no positive flows which could potentially be canceled exist. All of the 
successive incremental graphs have certain forward (solid) arcs missing, 
corresponding to arcs which are saturated. In addition, they have certain 
“reverse” (dashed) ares, corresponding to ares having positive flow, which 
could be canceled—by using the are in the reverse direction—without 
destroying feasibility. 

The simple nature of this example made it possible to find a shortest 
path through each incremental graph by inspection. In a more complex 
case, one must use a systematic technique, such as that of Chap. 3, to find 
each shortest path, since selection of a non-shortest path will result in a 
flow whose cost is not minimal and which is not a valid starting point for 
determining the next increment of flow. 

Note that the last chain flow does in fact involve a “reverse” are of 
the incremental graph J(N,¢10), so that the optimal flow of value 11 uses 
this are to a lesser extent than the optimal flow of value 10. 

In this example, there happened to be a unique shortest path in each 
successive incremental graph. It should be noted that this is not always 
the case. When there are several shortest paths, any one of these may be 
used as the basis for the next chain flow. 


Exercise 7-13 Referring to the preceding example, suppose that we start with 
the flow ¢; of value 3 obtained by assigning three units of flow to the bottom three arcs 
and Oto the rest. Construct the corresponding incremental graph I(N,¢s) and observe 
that it contains at least one cycle whose length is negative. (The theory has shown 
that this must be the case, since y; is not a minimal-cost flow of value 3.) 


It is important to note that, while we have bypassed flows of values 
1, 2, 3, 4, 6, 8, and 9 in the above example, we can produce minimal-cost 
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flows having these values by interpolation. For example, to obtain an 
optimal flow of value 8, we would add to ¢; one unit of flow in the chain 
used to produce ¢yo. 


Exercise 7-14 In Fig. 7-8a, direct each are from left to right, assume that the 
number associated with each arc represents its unit cost (a) as well as its capacity 
B(a), and assume that a(a) = 0, that is, the network is capacitated. Use the method 
of this section to find a sequence of minimal-cost flows the last of which has maximal 
value. 


7-10 SOME SPECIALLY STRUCTURED FLOW PROBLEMS 


As a first example of a class of problems that can be treated as flow prob- 
lems, we consider the so-called transportation problem, which is much 
studied in the literature of operations research. 

Let Si, ..., Sm and T;,..., 7, be a number of origins and 
destinations, respectively, between which some homogeneous substance 
must be distributed. Each origin S; has a limited supply A;, and each 
destination 7; has a specified demand B;. Moreover, there is a cost of 
U,; per unit associated with supplying 7'; from S;. In the simplest instance 
there is no restriction on the amount shipped from 8; to T;—except, of 
course, that it must not violate the constraints imposed by A; and B;. 
The problem is to distribute the material from the sources to the destina- 
tions at minimal cost, without oversupplying any destination or exceeding 
the capability of any origin. 

This problem can be considered as the problem of finding a minimal- 
cost flow of maximal value in the network of Fig. 7-12. [The ordered pair 
(x,y) associated with each are has the interpretation that « denotes the 
capacity and y the unit cost.) If movement from S; to 7, is not permis- 
sible for certain pairs 7, 7, the corresponding ares are merely omitted. 
Similarly, if there is an upper limit on the quantity shipped from S, to 7; 
a finite capacity is associated with the appropriate arc. In this case, of 


Figure 7-12 
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course, it may happen that not all destination demands can be met. How- 
ever, the solution to the flow problem will move the maximum amount 
possible, and it will do so at minimum cost. 

If m = nand if the A’s and B’s represent individuals and jobs, respec- 
tively, then the same model is appropriate for the basic assignment prob- 
lem, in which m men are to be assigned to the m jobs in such a way that the 
optimal utilization is achieved. (In this case U;; is a measure of the value 
of assigning the 7th man to the jth job.) The capacities A; and B; are set 
to unity, so that each man is assigned to at most one job and each job is 
performed by one man. If certain men are not qualified for certain jobs, 
the corresponding arcs are omitted. Here again, this may result in a solu- 
tion in which not all jobs are performed. (In this problem, one can negate 
are values, add a large constant to each, and minimize.) 

For bipartite graphs the maximal matching problem discussed in 
Chap. 6 can also be regarded as a flow problem in a network structured as 
in Fig. 7-12. In this case, all ares have unit capacity, costs are not a 
factor, and the problem is one of maximizing flow from source to sink. 
Note that the problem of increasing flow by finding a path in the incre- 
mental graph is really the same as that of finding an alternating chain 
joining two exposed vertices, as described in Sec. 6-14. 


Exercise 7-15 In Fig. 7-12 assume that m = n = 8, that every S; is joined to 
every 7’; by an arc, that each A; = 1 and each B; = 2, and that Ui; =i + 7. 
Find a minimal-cost feasible flow from source to sink having maximal value. 


7-11 MULTIPLE - COMMODITY FLOW PROBLEMS 


Until now, we have assumed that a single homogeneous substance or 
“commodity” is flowing in the network under consideration. If the net- 
work has several sources and/or sinks, we have assumed that the solution 
may join any source with any sink via a chain flow. Suppose, however, 
that we now have K commodities and that the vertices of the network 
include 2K distinguished vertices y;, z:, for? = 1,2, . . . , k, such that we 
seek k simultaneous flows each of which is a flow from y; to 2 for some 7 
and has a given value Q;. 

To distinguish between commodities, let f;;* denote the flow of the 
kth commodity from vertex v; to vertex v;, with the added convention that 
fit = —fou*. Let Cy; denote the capacity (in either direction) of the are 
joining vertices v; and »;. 

The problem, then, is to assign an integer f;,* to every are (7,7) for 
every commodity k in such a way that, for any fixed k, the set of arc flows 


FF = (foh 
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is a flow of value Q, from y; to 2; and such that, for each are (7,7), we have 
2fF| < Cy 


(It may happen that different commodities must flow in opposite direc- 
tions in the same are in order to achieve a solution.) 

For the two-commodity case (i.e., when K = 2), Hu® has obtained a 
theoretical result, analogous to the max-flow min-cut theorem, and also a 
computational procedure for solving the above problem. To illustrate 
the procedure; consider the network of Fig. 7-13. It is assumed here that 
each are has (symmetric) capacity 2, and that 


Qi = Q2 = 2 


We first disregard commodity 2 and seek any feasible flow of value Q; = 
of commodity 1 from y; to 2:, using the single-commodity techniques 
developed earlier. We might, for example, obtain the flow shown in Fig. 
7-14, 

If no such flow exists, the problem clearly has no solution. If, as in 
this case, one does exist, we next reduce all are capacities by the amount 


\fa'l 


and repeat the process for commodity 2 alone. In this example, the 
reduced capacities are shown in Fig. 7-15. Clearly no flow of commodity 
2 is possible, since the ares incident with source y2 have zero residual 
capacity. We conclude that we must first reroute the flow of commodity 
1 before we can accommodate any flow of commodity 2. 

We now give Hu’s technique for achieving a suitable rerouting. We 
first find (if possible) some chain C,, oriented from y2 to z2, which has the 


Cy 
uy 29 29 
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Figure 7-17 Figure 7-18 


property that at least one unit of commodity 1 may be added to each are 
in the direction induced by the chain orientation. We next find (if pos- 
sible) some chain C2, oriented from z2 to ye, having the same formal 
property. (C, and C2 may involve some of the same arcs.) Appropriate 
chains for our example are shown in Fig. 7-16. Then we increase the flow 
of commodity 1, in both chains, by one unit. (In practice, larger flow 
increments may be used in some cases.) This results in the adjusted flow 
of commodity 1 as shown in Fig. 7-17. Finally, we increase the flow of 
commodity 2 along C; and along the reverse of C2 by one unit and obtain 
the (dashed) flows shown in Fig. 7-18. 

In the present example we are through, since we have achieved simul- 
taneous flows, of value 2, of both commodities. If the flow of commodity 
2 were still less than Q2, we would proceed to find another pair of chains 
C, and C, and a corresponding rerouting of commodity 1 and increase in 
flow of commodity 2. It can be shown that we can continue to find pairs 
of chains and ultimately increase the value of flow of commodity 2 to Qs 
(while holding the value of flow of commodity 1 at Q:) unless the problem 
has no solution. 

Let A, and A, denote, respectively, the capacities of minimal-capacity 
cut-sets separating y; from z; and y2 from za. Also, let Ai2 be the capacity 
of a minimal-capacity disconnecting set which meets all chains joining 
yi and z; and also all chains joining y2 and zz. Then the two-commodity 
analog of the max-flow min-cut theorem is as follows: 

Theorem 7-9 There exist jointly feasible flows F; and F, from y; 
to z, and from y2 to z2, respectively, having values Q; and Qs, respectively, 
if and only if the following three conditions are all satisfied: 


QS4A1 QS Ar Q+Q:5 An 


Necessity of the conditions is clear. For sufficiency, see Ref. 9. 


7-12 STOCHASTIC FLOW IN NETWORKS 


In this section we give a brief summary of ideas which pertain to network 
flows but have their origin in the theory of queues. 
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Figure 7-19 Figure 7-20 


Consider a network whose ares have capacities prescribed according 
to given probability distributions that are functions of time (i.e., stochastic 
processes), and let it be desired to pass flow from the source to the sink 
of such a network. The problem is to determine the total average flow 
at any time through the network. In general, the amount of flow present 
at the source is also given according to a stochastic process. Thus at 
any given moment there may or may not be any flow passing through the 
source to an initial vertex of an are. If the amount of flow available 
exceeds the capacity of an are through which it must pass, it is delayed at 
the initial vertex of that arc. 

A queueing network consists of sets of interconnected specialized 
service channels connected in series and in parallel. Before each service 
channel (or set of service channels if they cooperate in parallel) there is a 
waiting line (whose length may be zero) of items ready to be served at the 
service channels. The output from a server may comprise the input of 
another. If we associate a vertex with the arrivals to a queue and another 
with its departures and if we join these vertices with an are corresponding 
to the service, we obtain a graph. The graph is simple if each queue 
consists of a single channel and a single line. If there are several channels 
in parallel, all with the same waiting line and all leading to a single output, 
we have a figure of the form of Fig. 7-19 to represent that part of the net- 
work. If, on the other hand, each of a set of parallel channels has its 
waiting line and all channels lead to a single output, we have a figure 
of the form of Fig. 7-20. The situation in which some of the parallel 
channels may lead to other channels in series or to sinks can be repre- 
sented by a figure of the form of Fig. 7-21. For the source vertex we take 
the local population in need of service and for the sink vertex the same 
population (with satisfied demand). 

In the theory of queues the flow consists of discrete items, e.g., 
customers; whereas in the usual network flow problems flow is considered 
as continuous, but more frequently one is interested in integer-valued 
flow which can be interpreted as discrete flow. Note that in the sto- 
chastic are capacity problem there are two alternatives. Either the 
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Figure 7-21 


flow is lost at the initial vertex when the are is entirely occupied with 
flow in process or it is delayed there awaiting passage. An interesting 
theorem for discrete flow applies to queueing networks in which flow 
has been in process for a long time and one is interested in the steady 
state, i.e., in the asymptotic behavior as the time t— ». 

In a certain queueing network flow which is common to a number of 
operations, arrivals are observed to occur at random according to the 
Poisson process 

(At)"e-™ 


oF Fm. Osby Bs. scars 


An important property of this process is that the probability of more than 
a single arrival in a small period of time is negligible compared with the 
probability of no arrivals or of one arrival. 

The times between arrivals which occur according to a Poisson 
process have an exponential distribution of the form \e-“. Thus queue- 
ing problems in which arrivals occur according to a Poisson process and 
enter service according to another such process may also be studied by 
considering the exponentially distributed interarrival intervals and serv- 
ice time intervals." 

If the are has exponential capacity (exponential times of service) 
and if the arrival intervals are exponential, the flow from the exit has 
exponential distribution. The requirement here is that the ratio of the 
input rate to the are passage rate (service rate) must not exceed unity; 
otherwise, flows will be delayed for infinitely long time at the initial 
vertices. With this theorem it is possible to trace flow through a net- 
work, because the output distribution from one terminal vertex is auto- 
matically a known input distribution with the given terminal vertex as 
an initial vertex of the next stage. 

An alternative representation of a queueing network is to associate a 
vertex with the waiting line and a second vertex with the service and 
let the are incident with the two vertices indicate a transition from wait- 
ing into service. One can then associate a third vertex with the depart- 
ing stream of customers and let the are incident with this vertex and the 
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service vertex represent the transitions from service to departure or to 
the next waiting line. Thus we can classify the vertices according to 
whether they are service vertices, waiting-line vertices, or departure 
vertices. 


Exercise 7-16 Interpret both of these representations for a network of queues in 
parallel and in series and draw the diagrams. 


Exercise 7-17 Mention ten possible conceptual ramifications of network flows 
such as stochastic flow, flow through porous pipes, and through deformable ares, 


More comprehensive expositions of network flow theory, from various 
points of view, are to be found in Refs. 5, 10, 12, and 14 of Chap. 1. The 
reader is also referred to the selected list of papers at the end of this 
chapter and to the extensive bibliographies contained in the above books. 
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GLOSSARY 


The following glossary presents brief, informal definitions of those terms which are 
basic and which recur frequently in the text. 


Aeyelic graph A directed graph having no cyeles. 

Adjacency matrix Same as Vertex matrix. 

Adjacent edges or Ares Two edges or ares having at least one common end 
point. 

Adjacent vertices Two vertices that are joined by an edge or arc. 

Are See Directed graph. 

Are progression A finite sequence of (not necessarily distinct) ares such that 
the terminal vertex of each arc (except the last) is the same as the initial vertex of 
the succeeding arc. The progression is said to be closed if the initial vertex of the 
first are is the same vertex as the terminal vertex of the last arc, and open otherwise. 

Articulation point A vertex of a connected graph whose removal disconnects 
the graph. 

Asymmetric graph A directed graph that has no are from w to » if it has an 
are from v to w. 

Bipartite graph A graph whose vertices can be partitioned into two sets in 
such a way that every edge has an end point in each set. 

Branch An edge of G which is contained in a specified tree T that is a subgraph 
of G; said to be a branch relative to 7. 

Capacitated network A network in which flow in each are is constrained to 
lie between zero and an upper bound called the capacity of the are. 

Chain A set of edges which, if properly ordered, form a chain progression. In 
a geometric graph, a set of edges which form an open curve. 

Chain progression An open edge progression having no repeated edges. 

Chord An edge of @ which is not contained in a tree 7 that is a subgraph of G; 
said to be a chord relative to 7. 

Chromatic number of a graph The smallest k such that the vertices of the 
graph can be partitioned into k sets each of which is an independent set of vertices. 

Circuit A set of edges which, if properly ordered, form a circuit progression. 
In a geometric graph, a set of edges which form a closed curve. 

Cireuit matrix A matrix whose rows and columns correspond to the simple 
cireuits and edges of a graph, respectively, and each of whose elements is 1 or 0, 
depending on whether or not the corresponding circuit contains the corresponding 
edge. 

Circuit progression A closed edge progression having no repeated edges. 

Complementary graph A simple graph G obtained by deleting the edges of a 
given simple graph G from a complete graph having the same vertices. 

Complete graph One in which every two distinct vertices are adjacent. 
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Component of a graph A connected subgraph which is not contained in any 
larger connected subgraph. 

Connected graph A graph such that every pair of distinct vertices are joined 
by at least one chain. 

Cut-set A disconnecting set consisting of all the edges that join a specified set 
of vertices with the complementary set of vertices. 

Cut-set matrix A matrix analogous to a circuit matrix, with proper cut-sets 
replacing simple circuits. 

Cyele A set of ares which, if properly ordered, form a cycle progression. In 
a geometric graph, a closed curve composed of consistently directed arcs. 

Cycle progression A closed are progression having no repeated vertices. 

Cyelic graph A directed graph having at least one cycle. 

Degree of a vertex The number of edges (or arcs) with which the vertex is 
incident, with loops counted twice. 

Directed Euler graph A directed graph such that the negative degree and 
positive degree of each vertex are equal. 

Directed graph A mathematical system consisting of two sets V and A, 
together with a mapping A of A into V X V. The elements of V and A are called 
the vertices and arcs of the directed graph, respectively, and A is called its directed 
incidence mapping. If A(a) = (v,w), v is called the initial verter of arc a and w is 
called its terminal vertex. 

Directed incidence mapping See Directed graph. 

Disconnecting set A set of edges of a connected graph whose removal dis- 
connects the graph. 

Dominating set of vertices A set W of vertices such that every vertex not 
in W is adjacent to a vertex in W. 

Edge See Graph. 

Edge progression A finite sequence of (not necessarily distinct) edges such 
that one end point of the first edge is also an end point of the second, the remaining 
end point of the second is also an end point of the third, etc. The edge progression 
is closed if the “free” end point of the first edge is the same vertex as the “free” end 
point of the last, and open otherwise. 

End points The vertices with which an edge or are is incident. 

Euler graph A graph having no vertices of odd degree. 

Finite graph A graph having a finite number of vertices and edges (or arcs, 
in the case of a directed graph). 

Flow An assignment of integers to the arcs of a network, these integers being 
interpreted as the rates of flow of a substance through the ares. 

Forest A graph that has no circuits, so that each component is a tree. 

Geometric graph A graph whose vertices are selected points in two- or three- 
dimensional space and whose edges are nonintersecting simple curves each of which 
joins two vertices (or, in the case of a loop, closes on a single vertex) without con- 
taining any other vertices. 

Geometric realization A geometric graph that is isomorphic with a given 
graph. 

Graph A mathematical system consisting of two sets V and £, together with 
a mapping I of Z into V & V (the set of unordered pairs of elements of V). The 
elements of V and E are called the vertices and edges of the graph, respectively, and 
is called its incidence mapping. If T'(e) = (v & w), vertices v and w are called the 
end points of edge e. 
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Graph of a binary relation A directed graph whose vertices correspond to 
the elements of the set in which the relation is defined and whose arcs join precisely 
those pairs of vertices which are in the relation. 

Hamiltonian chain, circuit, cycle, or path A chain, circuit, cycle, or path 
that is simple and that is incident with every vertex of the graph under consideration. 

Inarticulate graph A connected graph having no articulation points. 

Incidence The relationship between an edge (or arc) e and the vertices into 
which it is mapped by the incidence mapping of the graph. 

Incidence mapping See Graph, 

Incidence matrix A matrix whose rows and columns correspond to the vertices 
and edges of a graph, respectively. Each element is 1 or 0, depending on whether or 
not the corresponding vertex and edge are incident. 

Independent set of vertices A set of vertices no two of which are adjacent. 

Initial vertex See Directed graph. 

Isomorphic graphs A pair of graphs whose vertices and edges (or ares) can 
be placed in 1-1 correspondence in such a way that incidences are preserved. In the 
directed case, the sense of incidence (i.e., positive or negative) must also be preserved. 

Loop An edge (or are) that is incident with only one vertex. 

Matching A set of edges of a graph such that no two edges in the set are 
adjacent. 

Negative degree of a vertex The number of ares with which the vertex is 
negatively incident. 

Negative incidence The relationship between an are and its terminal vertex. 

Network A directed graph which is connected and has no loops. This term is 
applied in the context of flow theory. 

Parallel edges or Ares. Edges or arcs having the same end points. 

Path A set of arcs which, if properly ordered, form a path progression. In a 
geometric graph, an open curve composed of consistently directed ares. 

Path progression An open are progression having no repeated ares. 

PERT network A directed graph, used to characterize the interactions of 
individual activities within an overall complex project. The ares correspond to the 
activities, and their end points are chosen in a way which reflects constraints on the 
order in which the activities can be performed. 

Planar graph A graph that is isomorphic with a geometric graph in a plane. 

Positive degree of a vertex The number of arcs with which the vertex is 
positively incident. 

Positive incidence The relationship between an arc and its initial vertex. 

Primitive graph A directed graph such that, for a suitable integer k, every 
pair of distinct vertices can be joined by an are progression having precisely k terms. 

Proper cut-set A cut-set which does not contain any proper subset which is 
a cut-set. 

Reflexive graph A directed graph having a loop incident with each vertex. 

Regular graph A graph all of whose vertices have the same degree. 

Rooted directed tree A directed graph which is a tree and which has a vertex 
v such that, for every other vertex w, the unique tree chain joining v and w is in fact 
@ path directed from » to w; the tree is said to be rooted at the vertex v. 

Simple chain A chain having at most two edges incident with any given 
vertex. 

Simple circuit A circuit having at most two edges incident with any given 
vertex. 
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Simple cycle A cycle having at most two ares incident with any given vertex. 

Simple graph An undirected graph that has no parallel edges and no loops; 
a directed graph that has no strictly parallel arcs and no loops. 

Simple path A path having at most two ares incident with any given vertex. 

Sink A vertex in a network at which flow input exceeds output. 

Source A vertex in a network at which flow output exceeds input. 

Spanning tree A subgraph of a connected graph G which is a tree and which 
includes all vertices of G. 

Strictly parallel ares Arcs having the same initial vertex and the same 
terminal vertex. 

Strongly connected graph A directed graph for every ordered pair of distinct 
vertices (v,w) of which there is at least one path from »v to w. 

Subgraph A graph G’ = (V’,E’,T’) so related to a graph G = (V,H,T) that 
Vv’ CV, E’ CE, and r’(e) = P(e) for every edge e « EL’; the first graph being said to 
be a subgraph of the second. It is necessary that V’ include all end points (relative 
to @) of edges in EZ’. A directed subgraph of a directed graph is defined in essentially 
the same way if E and I are interpreted as the set of arcs and the directed incidence 
mapping, respectively. 

Symmetric graph A directed graph whose arcs can be grouped into pairs of 
parallel but oppositely directed ares. 

Terminal vertex See Directed graph. 

Total graph A directed graph such that, for every two distinct vertices v and w, 
there is a path from » to w or one from w to v (or both). 

Transitive graph A directed graph which has an are from u to w whenever 
it has an are from u to v and one from v to w. 

Tree A connected graph which has no circuits. 

Undirected graph Same as Graph. 

Unicursal graph A graph (directed graph) the totality of whose edges (arcs) 
forms a circuit (cycle) or chain (path). 

Vertex See Graph and Directed graph. 

Vertex matrix A matrix whose rows and columns correspond to the vertices of 
a graph and whose elements indicate the number of edges joining the corresponding 
pairs of vertices. 


ANSWERS TO EXERCISES 


CHAPTER 1 


Exercise 1-2 Let the corresponding vertex sequence be 1, vs, . . . , Un, where 
v: =vand v, = w. If all the v;’s are distinct, the edge progression is necessarily a 
simple chain progression. If »; = vj for some i < j, delete the edges corresponding 
to the vertex subsequence vj, viy:, . . . . vj. Repeat such deletions until all vertices 
in the sequence determined by the remaining edge progression are distinct. 


Exercise 1-3 Let G consist of three vertices »;, v2, and v; and two edges e; and es, 
where e: ~ (vi &v:) and e: ~ (v: & v2). The edge progression e:, e1, es, joining 1 
and ps, is the desired example. 


Exercise 1-4 Three or more edges, all of which have one common end point 
and whose other end points are all distinct, satisfy the stated characterization but 
fail to have the quality of “traversability,” which is vital for the concept of an edge 
progression. 

Exercise 1-5 One can obtain an open edge progression from v; to »; by simply 


adjoining the chain progression from », to v; to the chain progression from », to w». 
If this progression is not a simple chain progression, use Exercise 1-2. 


Exercise 1-6 The example given for Exercise 1-3 will suffice. 


Exercise 1-7 It is sufficient to show that the edges of a simple chain C joining 
vand w can be traversed in just two ways (one from » to wand one from w tov). Note 
first of all that precisely one edge of C is incident with v or w and precisely two edges 
of C are incident with any other vertex which meets C. Otherwise, the sequence of 
vertices induced by traversing C would include a repeated vertex. Consequently, 
if we start at v and wish to traverse each edge precisely once, we have at each stage a 
unique choice for the next edge to be traversed. 


Exercise 1-8 If a loop is added at one or more of the vertices along a simple 
chain, the resulting nonsimple chain can still be traversed in just two ways. 


Exercise 1-9 Note first that every vertex which is incident with the circuit 
has even degree relative to the circuit. Remove loops at the outset. Starting at 
any vertex v, select an edge incident with v and traverse it. From the vertex reached 
in this way, select and traverse a new edge (which must necessarily exist). Repeat 
this process until a vertex previously reached is encountered. Remove the simple 
circuit obtained in this way. Vertices incident with the remaining edges (if any) 
have even degree relative to these, and the process can be repeated. 


Exercise 1-10 Modify the construction indicated for Exercise 1-9, or else join v 
and w by means of a new edge, apply Exercise 1-9, and then remove this edge. 
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Exercise 1-11 Consider the closed progression e, ¢ and the open progression 
e, €, e involving a single repeated edge. 


Exercise 1-12 Arrange the vertices of odd degree in pairs. Join each pair 
by an edge. Then add enough loops at each vertex to attain the desired degree. 


Exercise 1-13 If the edge is not contained in any circuit, then its end points 
are not connected by any chain when the edge is removed. If it is contained in a 
circuit, then the circuit, less this edge, is a chain C joining the edge’s end points. Thus 
any edge progression that involves this edge can be replaced by one that does not by 
substituting C; and so any pair of vertices which were connected before remain 
connected. 


Exercise 1-14 A simple chain (and its vertices) or a Y-shaped graph has this 
property. 


Exercise 1-15 Delete all edges; then replace them, one at a time, in such a way 
that at each stage (after the first) the edge being added is adjacent to at least one 
previously added edge. After the first edge, which “covers” two vertices (unless it 
is a loop), each succeeding edge covers at most one new vertex. 


Exercise 1-16 If some vertex has degree less than 2, it is not incident with any 
simple circuit; while if some vertex has degree greater than 2, no simple circuit can 
account for all edges incident with it. On the other hand, if every vertex is of degree 
2, we can start at any vertex, traverse one of the two edges incident with it, then 
traverse the unique edge adjacent to the first, and continue in this way. Ultimately, 
all edges will be traversed precisely once, and the starting vertex will be reached once 
again, The characterization for chains differs in that precisely two vertices have 
degree 1. 


Exercise 1-17 Start with any edge progression. If not all edges are included, 
use connectivity to show that a new edge can be included by adjoining an appropriate 
edge progression to the end of the present one. By continuing this process, all edges 
can be ultimately included. 


Exercise 1-18 The two edges leading to the left from W and the three leading 
to the right are the desired cut-sets. 


Exercise 1-19 It is a general property of improper cut-sets. Consider the 
cut-set determined by some vertex set W and its complement W’. If, after removal 
of the cut-set, the vertices of W’ determine two or more connected components with 
vertex sets Wj,..., W, then these vertex sets and their respective comple- 
ments determine disjoint proper cut-sets whose union is the original cut-set. 


Exercise 1-20 No, 


Exercise 1-21 A single edge in a connected graph is a cut-set if and only if that 
edge is not included in any circuit. 


Exercise 1-22 Base a proof on the answer to Exercise 1-19. 
Exercise 1-23. q — 1. 


Exercise 1-24 The complete graph having two vertices is bipartite. In general: 
the complete graph having q vertices is g-partite but not p-partite for any smaller p. 
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Exercise 1-25 Consider the 3-regular graph corresponding to the vertices and 
edges of a cube. It is possible to change one end point of two appropriate edges, 
leaving others fixed, in such a way that the resulting graph is still 3-regular but includes 
triangles (i.e., 3-edged circuits). 


Exercise 1-26 Since a bipartite graph need not be connected, whereas the 
existence of cut-sets requires connectivity by definition, the answer is no. However, 
it is true that a connected graph is bipartite if and only if the totality of its edges is a 
cut-set. 


CHAPTER 2 


Exercise 2-1 If the arcs could be traversed in two different ways, some vertex 
would necessarily have two outgoing arcs. But then, in order for the traversal 
to include both arcs, such a vertex would occur twice in the corresponding vertex 
sequence. Consequently, the path is nonsimple. If loops are adjoined to some 
vertices along a simple path, the resulting nonsimple path may also be traversed in 
only one way. 


Exercises 2-2 to 2-4 Analogous to the corresponding proofs for the undirected 
case. 


Exercise 2-5 No. (Join two disjoint cycles by means of a path, having at least 
two ares, whose interior vertices are not incident with either cycle.) 


Exercise 2-6 To show that the converse is not necessarily true, consider a 
graph having vertices v, w, x, and y and ares (v,x), (z,y), (y,w), (w,x), and (y,v). To 
prove the theorem, adjoin one path progression to the other, forming a closed are 
progression. Show that some subset of consecutive arcs that is a cycle progression 
exists. 


Exercise 2-7 Start at any vertex 1. Since v; has at least one outgoing arc, 
traverse one such arc, arriving at vertex ve. Apply the same reasoning at v2, traversing 
some are to arrive at v;. Continue until the first time a vertex is repeated. The 
ares traversed between the first and second occurrence of this vertex then form a cycle. 


Exercise 2-8 Label every vertex with either A or B, so that no two A or B 
vertices are adjacent. The desired path must alternate between A and B vertices, 
but these do not exist in equal numbers. 


Exercise 2-9 Similar to Exercise 2-8. Since the rook reaches white and black 
squares alternately, it cannot start on white and also end on white after 63 moves. 


Exercise 2-10 The existence of such a progression clearly implies strong con- 
nectedness. To prove the converse, consider a strongly connected graph and a closed 
edge progression P which starts and ends at vertex v. If some are (z,y) is not included 
in P, adjoin to P an are progression from pv to z, then are (z,y), and then an are 
progression from y to v. If the resulting closed progression fails to include all ares, 
repeat the process. 


Exercise 2-11 If, for some W, there is no are from W to V — W, then there 
can be no path from any vertex in W to any in V — W, so Dis not strongly connected. 
Tf D is not strongly connected, choose v and w in such a way that there exists no path 
from v to w. Choose as W the set of vertices consisting of » and all vertices reachable 
from v by paths. Then there exist no arcs from W to V — W. 
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Exercise 2-12 Two vertices in the same cycle are clearly joined by a path. 
For two vertices in different cycles, construct an are progression from one to the 
other by taking appropriate portions of consecutive cycles; and use the fact that an 
are progression contains a path progression as a subprogression. 


Exercise 2-13 Note that there is at least one cycle of length 3 containing v and 
that every vertex is joined with v by a cycle of length 2. These two facts in conjunc- 
tion can be made the basis of a proof. 


Exercise 2-14 (n + 1)/2*. 


Exercise 2-15 Show that there is at most one vertex which is not the terminal 
vertex of any arc. Then show that there exists a path from this vertex to each other 
vertex. Finally, show that a subset of the arcs in these paths (one to each other 
vertex) which is a rooted tree exists. 


Exercise 2-16 Show that it is possible to arrange directions in such a way that 
there are two vertices neither of which is the terminal vertex of any are. Then show 
that there cannot be a rooted tree spanning the graph under these circumstances. 


Exercise 2-17 Given any cycle C and any vertex v incident with C, repeated 
application of transistivity reveals that there is a loop at v. 


Exercise 2-18 All such graphs have isolated vertices. Moreover, every com- 
ponent which is not an isolated vertex has a loop at each vertex and at least one are 
from each vertex to every other vertex of the component. 


Exercise 2-19 In (a), for example, the ares are (0,0), (1,1), (4,2), and (9,3). 


Exercise 2-20 Corresponding to every path of two ares, add a new are joining 
its end points directly. The set of such arcs represents “is a grandparent of.’ For 
every path consisting of a “parent” are followed by a “grandparent” are, add a new 
direct are. These arcs represent the desired relationship. 


CHAPTER 3 


Exercise 3.1 In Figs. 1-2, 1-3, and 1-6 the entire edge set is a chain or circuit. 
In Fig. 1-5 any one of the edges e2, es, or es, together with the complementary set, 
yields a minimal covering of “size’’ 2. 


Exercise 3-2 Let » and w be some (but not all) vertices having odd degree 
and assume that some minimal covering includes a chain joining vand w. Show that 
connectivity implies the possibility of producing a modified minimal covering which 
includes a chain joining v with some vertex other than w. 


Exercise 3-3 A circuit and chain illustrate the second and third cases, respec- 
tively. For the first ease consider a graph with vertices v, w, 2, and y, edges (v & w) 
and (z & y), and two parallel edges (w & x). Removal of one of the parallel edges 
reduces the size of a minimal cover from 2 to 1. 


Exercise 3-4 Given a partitioning of the vertices into W and W’, in traversing 
an Euler circuit it is clear that each crossing from W to W’ is matched by a crossing 
from W’ to W and that each edge of the cut-set is traversed precisely once. 


Exercise 3-5 One possible cyclic sequence is aaaabbbbaababbab. 
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Exercise 3-6 One possible cyclic sequence is aadabbddbccdcach. 


Exercise 3-7 It consists of a single vertex corresponding to a (q — 1)-letter 
word, all of whose letters are a, and a loop at this vertex. 


Exercise 3-8 Given any (g — 1)-letter word, p different g-letter words are 
formable by prefixing any of the p letters of the alphabet. Since each of the latter 
words occurs once, the original word occurs p times. Repetition of this argument 
establishes the general result. 


Exercise 3-9 In Figs, 2-2, 2-4 (left), 2-4 (right), and 2-7 the number of paths 
and/or cycles in a minimal covering is 1, 2, 2, and 13, respectively. Obtain specific 
coverings by inspection. 


Exercise 3-10 Simply traverse an Euler circuit (or chain) and assign to each 
edge the direction induced by the traversal, 


Exercise 3-11 Yes. 


Exercise 3-12 Parallel the proof of the corresponding undirected result (Theo- 
rem 3-1). = 


Exercise 3-14 If there exists a hamiltonian circuit, it necessarily includes the 
10 edges incident with the 5 vertices of degree 2. But these edges form a circuit, 
and they cannot be a subset of a larger simple circuit. 


Exercise 3-15 See answer for Exercise 3-16. 


Exercise 3-16 Since the edges of a chain or circuit in a bipartite graph alternate 
between two vertex sets, the existence of a hamiltonian circuit implies that these 
vertex sets have the same size. Existence of a hamiltonian chain implies that they 
have the same size or differ in size by 1. 


Exercise 3-18 Visit the vertices in the order AF LKJEDCIHBGA. 


Exercise 3-19 Complete graphs necessarily have this property. So do graphs 
corresponding to simple circuits and chains having an even number of vertices. 


Exercise 3-21 For the graph corresponding to a simple circuit (other than a 
loop) having an odd number of vertices, every dominating set necessarily contains a 
pair of adjacent vertices. 


Exercise 3-22 The radius is n, the diameter is 2m, and one vertex is a center. 


Exercise 3-23 For a chain having 2n + 1 edges, the quantities are n + 1, 
2n + 1, and 2, respectively. For a circuit having 2n edges they are n, n, and 2n, 
respectively. 


Exercise 3-24 A simple graph is complete if and only if its diameter is 1. 
Exercise 3-25 For examples, see answers to Exercises 3-22 and 3-23. 


Exercise 3-26 Let the number of edges in a longest chain be n. The end points 
of such a chain are extreme vertices of the graph (i.e., vertices of degree 1), and T’ = n. 
Show that the distance from a vertex which is adjacent to one of these extreme points 
to any vertex of the graph is at most n — 1. Hence, R < T. 


Exercise 3-29 The distances range from 1 to 5, two vertices having distance 4 
from »;. One vertex is inaccessible from 21. 
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Exercise 3-30 The distances from »; vary from —3 to 4. 


Exercise 3-31 Fourth-stage distances (top to bottom) are 9, 8, and 11. Fifth- 
stage distances are 12 (top) and 10. The shortest distance to v is 10. 


CHAPTER 4 


Exercise 4-1 Place the south pole of a sphere inside the square and map the 
graph onto the sphere. Then rotate the sphere so that the image of the square con- 
tains the north pole and map back onto the plane. 


Exercise 4-2 The existence of such a subgraph means that one can take one 
of the two Kuratowski subgraphs, subdivide some of its edges by introducing vertices 
of degree 2, and arrive at a graph isomorphic to this subgraph. Alternatively, we 
could start with the given subgraph and eliminate its vertices of degree 2 by regarding 
chains of edges whose end points are of degree greater than 2 and whose interior points 
are of degree 2 as single edges. 


Exercise 4-3 In Fig. 4-7, if the dashed graph is regarded as the original graph, 
the solid graph may be regarded as its dual, since it can be constructed by means of 
the construction described in the text. 


Exercise 4-4 The dual graph would be the complete simple graph having five 
vertices, which is nonplanar. 


Exercise 4-5 (a) Observe that any edge of the complete graph is adjacent to 
n — 2 other edges at each end. (b) Let the two nonadjacent vertices correspond to 
edges (v & w) and (x & y). Then the four vertices corresponding to (v & x), (v & y), 
(w &z), (w& y), and only these, are adjacent to both vertices. (c) Let the two 
adjacent vertices correspond to edges (v & w) and (w& zx). Then each is adjacent 
to the n — 2 other vertices corresponding to edges of the form (w & y), where y # v 
or 2. 


Exercise 4-6 According to part (a) of Exercise 4-5, each of these two vertices 
is adjacent to 2(n — 2) vertices. According to part b, four vertices occur in both sets 
of 2(n — 2) vertices. Thus there are 4(n — 2) — 4 distinct vertices adjacent to one 
or both of the original two vertices. 


Exercise 4-7 The three families are Dick, Jane, and Emily; Tom, Mary, and 
Michael; and Harry, Susan, and Alan. 


Exercise 4-9 Note that every vertex of the resulting graph has even degree, 
and apply Theorem 4-23, 


Exercise 4-10 The result clearly holds for six or fewer regions. Assume it holds 
for n. Consider a map with n + 1 regions and shrink some region which has five or 
fewer sides to a point. Color the resulting map with six colors. When the shrunken 
region is restored, there is at least one color not assigned to any of its neighbors. 


Exercise 4-11 Show that the number of vertices and edges satisfy 2m > 3n 
and substitute in Euler's formula. 


Exercise 4-12 Use Euler’s formu'a to show that if every region is bounded by 
at least 5 sides, then there are at least 30 edges. However, use Exercise 4-11 to show 
that if there are fewer than 12 regions, then there are fewer than 30 edges. 
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Exercise 4-13 Use an inductive proof. Assume that the theorem holds for 
nregions. Every map with n + 1 regions has one region having at most four neigh- 
bors. If such a region has three or fewer neighbors, shrink and reexpand as in Exer- 
cise 4-10. If it has four, there are two neighboring regions which are not neighbors 
to each other. Form a single region from the original region and these two neighbors. 
Color the resulting map, and show the possibility of coloring the original map when 
the boundaries are reinstated. 


Exercise 4-14 Each triangular face can be assigned one of two numbers (or 
orientations) in such a way that for any vertex, the number of faces of one type meeting 
that vertex is congruent, mod 3, to the number of the other type. 


Exercise 4-17 The following figure shows a map of six regions on a Mébius 
strip (whose left and right sides are so joined that the arrows coincide) such that 
each region is adjacent to all five others. 


CHAPTER 5 


Exercise 5-1 The possibility of permuting rows and /or columns of one incidence 
matrix in such a way that the matrix is identical with a second incidence matrix is 
clearly equivalent to establishing 1-1 correspondences between vertices (rows) and 
also between edges (columns) in such a way that incidences are preserved. 


Exercise 5-2 The column corresponding to are (vs,v5), for example, has a 1 in 
the row corresponding to vs, a —1 in the row corresponding to vs, and 0’s elsewhere. 
Note that if the vertices and arcs associated with one component are all listed before 
(ie., above and to the left of, respectively) the rest, the matrix is block-diagonal. 


Exercise 5-3 Taking the v; row of A and the C2 row of C (i.e., the C2 column 
of C’), for example, which are 


@ 1101000 0 0) 
and 
@ 10041100 0 0) 


respectively, the sum of term-by-term products is 2, which is congruent to 0 mod 2. 
The same holds for every row of A and of C. 


Exercise 5-4 If the set of edges incident with some vertex v of a connected graph 
is not a proper cut-set, then the removal of these edges divides the graph into at least 
three components. But then » is an articulation point. 
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Exercise 5-5 Since the circuit matrix and the transposed cut-set matrix are 
orthogonal, k; + ke — q < 0 (where /, and fz are their respective ranks). Moreover, 


k=m—-n+l 
andqg =m. 


Exercise 5-6 Letting the rows of A correspond (top to bottom) to v: to v4 as in 
Fig. 5-3, we have 


00 1000 
a fe = Hal, 200 
4u=|5 9 4u= 1G to 4 
1 0, 001 0 
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Exercise 5-9 For the complete simple graph having five vertices, numbered 1 
to 5, arrange the edges in the order 


23 24 25 34 35 45 12 13 14 15 
and consider the tree to be that determined by the last four edges. Then 
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= a 
Exercise 5-11 If V* > 0 for some k, then (V*)" > 0 whenever tn > k. 


Exercise 5-12 Since the graph is primitive, there is an h such that, for all powers 
of the incidence matrix > h, every entry in the row corresponding to »; is greater 
than zero. 


Exercise 5-13 Since the graph is strongly connected, there is an are from 1% 
tov; for some k. So an are progression of length h; + 1 from »; to v; can be obtained 
by taking an are progression of length A; from »; to vz and adjoining the arc from 
v, to v;. Similar reasoning can be applied to hy + 2, hi + 3, ete. 


Exercise 5-14 This follows from Exercises 5-12 and 5-13 and the definition 
of index of primitivity. 


Exercise 5-15 Every open are progression contains a simple path joining the 
same vertices. Such a path has length at most n — 1. If the length less than n — 1, 
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adjoin to such a progression a loop taken a sufficient number of times to yield length 
n-1. 


Exercise 5-16 Consider the left component in Fig. 5-3 with ares indexed as in 
Fig. 5-5, except that e: is regarded as e:. Then, if the end points of the ith are are 
denoted as p21, pai, we have 
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CHAPTER 6 


Exercise 6-1 If D(A) contains any cycles, shrink some cycle by replacing all 
its vertices by one pseudovertex, and delete all loops created in this way, Continue 
to shrink cycles until none exist. (The final graph will be a single vertex if the original 
graph was strongly connected.) Establish the fact that every end vertex (i.e., a vertex 
which is not the initial vertex of any arc) of the final graph corresponds to a set of 
vertices of the original graph which determine a closed strongly connected subgraph. 
Moreover, since the final graph has no cycles, every vertex of the final graph which 
is not an end vertex is joined by a path to an end vertex. 


Exercise 6-2 Given that z(J — B) = w has an admissible solution, form 


v-(0%) 


where p is a column vector of zeros. Let 2’ be a variable corresponding to the one 
new activity. Then 

(zx) — B*) = (I — B) — z'w,0) 
If x is an admissible solution of the open model, then (z,1) is a solution of this enlarged 
closed model. 


Exercise 6-3 Let the vertices associated with A be 1, . . . , vn and let w be 
the vertex associated with the last row and column. There is an arc (v;,w) and an are 
(w,x:) for every i, since z and y are assumed positive. Hence there exist cycles of 
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length 2. If there is at least one are of the form (»;,v;), then there also exist cycles of 
length 3. But then the GCD of lengths of cycles is 1, which implies primitivity. 
To see that the index of primitivity is 4 for some matrices A, suppose that there is 
only one arc of the form (v;,v;). Then there is no are progression of length 3 from 
v; to vu. 


Exercise 6-4 Show that if some activity (v;,v;) fails to be completed before some 
adjacent activity (v;,v.) is begun, then these activities could not have been started 
at T(v;) and T(v;), respectively. 


Exercise 6-5 Note that 7T(v;) + X(v;) is the length of some path from uy to un. 
Thus this sum does not exceed T(v,). If equality holds, this path is necessarily a 
critical path. 

n(n — 1)? 


2 and hence the 


Exercise 6-7 For the odd case, we have dre = 


maximum number is (3) +4 (2) — #n(n — 1% 
Exercise 6-8 Use the same technique as in Fig. 6-15. 


Exercise 6-9 Use the same technique as in Fig. 6-15. 


Exercise 6-11 The graph has the structure shown in the accompanying figure. 
Note that there are 60 intersections. This agrees with the value of Mo. 
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Exercise 6-12 Assume that the cubes are stacked in such a way that, from top 
to bottom, they are cubes 1, 2, 3, and 4. It is possible to arrange that sides z, y, 2’, 
and y’ (top to bottom) are colored GYBR, BGRY, BRYG, and YBRG, respectively. 


Exercise 6-13 Yes, fewer crossings are required. If the distinction between 
Cand K is removed, by replacing K by C, then in the solution the subsequence MMCK, 
(MC), MMCC, (CC) can be replaced by MMCC, (CC), eliminating one round trip. 


Exercise 6-14 Number the vertices 1 to 8 clockwise, starting at v. The desired 
path goes through the vertices in the order 143821856. Note that 1 and 8 are repeated. 


Exercise 6-15 This is essentially the vertex or adjacency matrix of the graph 
shown in Fig. 6-20. 


Exercise 6-16 This follows from the fact that n elements (the diagonal elements) 
have fixed values, while the remaining n* — n can assume either of two values. 


Exercise 6-17 Use as V the directed vertex matrix having 1’s in the following 
positions: (1,2), (2,1), (2,3), (2,4) (3,4), (4,2), (4,5), (5,2), (5,7), (6,5), (7,5). Raise 
V to appropriate powers to obtain the answers. 


Exercise 6-18 If an inner vertex were covered by more than one edge of the 
selected set, its degree would exceed 2. If an outer vertex were covered by more than 
one edge, the graph could not be a tree. 


Exercise 6-19 Note that if an inner vertex were exposed, then two outer vertices 
would also be exposed, since every edge joins an inner vertex with an outer vertex. 


Exercise 6-20 Consider a chain progression passing through vertices vo, v, 

. , Ms Where vo is the exposed outer vertex. The edge ‘oining vp with » is not in 

M, since vp is exposed. Since » is of degree 2 and is covered, the edge joining »; with 
va must be in M. Repetition of this reasoning establishes the alternating property. 


Exercise 6-21 In the case of Fig. 6-17 (including the dashed part), matchings 
which leave only one vertex exposed can be found. For Fig. 6-15 perfect matchings 
exist. In both cases the maximal matching is not unique. 


Exercise 6-22 In general, these questions may be answered by summing (a) 
rows or (b) columns, computing V? (c), and computing V + V? + V* + V* (d). 
Explain why. 


Exercise 6-25 Let ¢; denote the ith term. Whenever ¢; — tis: > 1, insert 
t; — ti41 — 1 ones after ¢;. Whenever ¢; < ti4:, insert t; — 1 ones after t;. (Do the 
latter also when ; is the last term.) 


Exercise 6-26 t) = 4. 


Exercise 6-27 Let v have a loop. If the graph has n vertices, then there exist 
paths from any v; to v, and from » to any v;, having at most n — lares. Thus one can 
obtain an are progression of length 2n — 2 by taking two such paths together with an 
appropriate number of loops. (Compare with the earlier discussion of primitive 
matrices.) 


Exercise 6-28 There are four automorphisms. The top and bottom vertices 
may be either permuted or left fixed. Independently of this, the left and right vertices 
may be either permuted or left fixed. 
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Exercise 6-29 The graph corresponding to cubing the residues has a loop at 
each vertex. The graph corresponding to squaring the residues has the following 
ares: (0,0), (1,1), (2,4), (8,3), (4,4), (6,1). 


7/0123 4656 
2154063 * 1 


Exercise 6-31 The arcs of the graph form a simple cycle which passes succes- 
sively through 0, 5, 4, 3, 2, and 1. 


Exercise 6-30 


* Undefined. 


Exercise 6-32 With respect to the first partitioning, rearrange the rows and 
columns into the order 1, 3, 2, 4, by taking 


P=P= 


Soccer 
cross 
or) 
eooo 


Compute B = P-!AP and note that its submatrices are as given. Note also that B* 
is block-diagonal, each diagonal block being 


( 2 0 
0 12 
Find an appropriate P for the other partitioning. 


Exercise 6-33 41 comparisons are required. (15) and N(15) are 41 and 45, 
respectively. 


CHAPTER 7 


Exercise 7-1 Let ¢: be the flow given in Fig. 7-2 and let g» consist of unit flow 
in each are of the same network. Then the net flow outputs at » to vs, relative to 
v1 + ¢, are 8, 5, 3, —3, —13, and 0, respectively. Relative to ¢1 — ¢s, they are 2, 
7, —3, 3, —7, and —2, respectively. Compare with net outputs relative to gy: and gs 
individually. 


Exercise 7-2 The flows are not conformal. In two ares, namely, (v3,v) and 
(v1,v4), the are flows are both nonzero and oppositely oriented. 


Exercise 7-3 The numerical results will depend entirely on the choice of flows 
in Exercise 7-2. and the choice of y. For each are, verify that (a) + ¢:(a) lies between 
¥(a) and (a) + ¢i(a) + ox(a). 


Exercise 7-4 In the unitary graph, there are, for example, three ares from v; to 
v3, ten arcs from vs to v2, two ares from vs to vs, no arcs between vs and 04, ete. 


Exercise 7-5 The following is one decomposition, given in terms of the sequence 
of vertices through which the oriented chains and circuits pass. (Asterisks indicate 
ares traversed oppositely to their inherent directions.) 


Answers to exercises 283 


6 chains wwavswe 

2 chains wwiwapsw2 

2 chains w,v,v30v,w, 
1 chain winwwewe 

1 circuit v,v7wir; 

1 circuit v,v,7¢0; 


Exercise 7-6 There are eight relevant partitionings. When W = {v:,v2,v5}, for 
example, the right-hand expression in Theorem 7-3 has the value (4 + 3 + 2) — (1 + 6). 


Exercise 7-9 The are (v:,v2), for example, having lower and upper flow bounds 
1 and 3, respectively, is replaced by three arcs: (vp,v2), (v1,v6), and (v1,v2), having 
capacities 1, 1, and 2, respectively. Their lower flow bounds are all zero (as are those 
of all ares in the auxiliary network). 


Exercise 7-11 The maximal-flow value through N’ is only 7, whereas a saturat- 
ing flow requires 19 (the sum of lower flow bounds of all ares in the original network). 


Exercise 7-12 Show that if such a cycle exists, then ¢ plus a unit flow in this 
eycle is feasible and has the same total cost as ¢. On the other hand, show that if a 
second minimal-cost flow ¢’ exists, then each unit circuit flow in a decomposition of 
¢’ — ¢ determines a cycle of this variety in the incremental graph. 


Exercise 7-13 Note, for example, that the outermost clockwise cycle in the 
incremental graph has total length 4 + 1 —2—3—1= —1. Thereisalsoa second 
clockwise cycle, including all vertices except the source, with length —2. 


Exercise 7-14 Send two units of flow along the chain v, z, w, at unit cost of 5. 
Then send one unit along the chain », x, 2, w, at unit cost 7. Finally, send two units 
along the chain », y, z, w, at unit cost 9. Total cost: 35. 


Exercise 7-15 The left-to-right flows in the following figure indicate one solution. 
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Branched-chain hydrocarbons, 200 
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Bridg-it, 168 

Bundle, 118 

Butane, 200 


Capacitated network, 244, 250, 267 
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Carbon atoms, 199 
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Caseade arc, 187 
Cascade network, 187 
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Cayley color graphs, 224 
Center of graph, 56 
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Centered tree, 200 
Centrality index, 185 
Chain, 11, 12, 68, 210, 267 
hamiltonian, 42 
simple, 12, 20, 269 
Chain flow, 241, 261 
Chain group, 117 
Chain progression, 12, 267 
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Channels, 184 
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Circuit, 11, 12, 20, 86, 139, 267 
electrical, 179 
Euler, 35 
hamiltonian, 42 
longest, 42 
simple, 12, 269 
of three edges, 76 
Circuit-basis matrix, 108 
Circuit flow, 241 


Circuit matrix, 104, 267 
Circuit postulate, 180 
Circuit progression, 12, 267 
simple, 12 
Closed arc progression, 26 
Closed curve, simple, 4 
Closed edge progression, 11, 26 
Closed input-output model, 125 
Closed network, 192 
Closed polyhedron, 93 
Closed subgraph, 125 
Coalescences, 91 
Coboundary operator, 118 
Cochain, 119 
Cocycle, 118 
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Colored cubes, 153 
Coloring edges, 80 
Coloring problems, 55, 66 
Combinatorial problems, 135 
Combinatorial topology, 117 
Commodities, 261 
Communication channels, 184 
Communication networks, 184 
synthesis of, 185 
Communication patterns, 210 
Communications between individuals, 28 
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Competition, 228 
Complementary graph, 78, 79, 267 
Complete graph, 19, 80, 82, 136, 151, 267 
Complete machine, 219 
Component of graph, 14, 102, 268 
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counting series for, 145 
equivalent, 145 
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Conformal flow, 240 
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planar, 68 
Constitutive equation, 180, 183 
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Convex polyhedron, 56 

Cost, 255 

Cost function, 255 

Couplers, 183 

Coupling, 179 

Covered vertex, 170 

Covering, of a board, 169 
of a directed graph, 36 
of a graph, 34, 36 
minimal, 34, 37 

Critical activity, 131 

Critical path method, 129 

Crossing problem, 160 

Cubes, 153 

Curie point, 202 

Customers, 264 

Cut player, 167 

Cut-set, 17, 29, 185, 268 
proper, 17, 20, 29, 269 

Cut-set matrix, 105, 109, 268 

Cybernetics, 205 

Cycle, 27, 268 
hamiltonian, 47 
maximal number of, 140 
simple, 27, 270 

Cycle index, 144 

Cycle progression, 27, 268 
simple, 27 

Cyclic directed graph, 27, 268 

Cyclic ordering, 96 

Cyclic representation, 143 

Cyelically k-partite graph, 226 

Cyclomatic number, 107 


d-complete directed graph, 47 
Decomposition, 243 
Degenerate graph, 6 
Degree, of regular graph, 19 

of vertex, 9, 25, 35, 268 
Degree-constrained subgraph, 178 
Departure, 266 
Deterministic machine, 219 
Diagonal matrix, 183 
Diameter, of directed graph, 58 

of graph, 57 
Diatomic molecule, 202 
Differentiation operator, 183 
Digital computer, use, 252, 257 
Digraph, 24, 141 

(See also Directed graph) 


Index 


Dimer problem, 202 

Direct product of polyhedra, 78 
Directed edge, 24 

Directed Euler graph, 37, 268 
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Directed graph, 24, 100, 164, 209, 226, 


242, 268 

acyclic, 27, 130, 267 

antisymmetric, 139 

asymmetric, 32, 267 

covering of, 36 

cyclic, 27 

d-complete, 47 

diameter of, 58 

isomorphic, 24 

planar, 25 

pseudosymmetric, 37 

radius of, 68 

reflexive, 31 

simple, 26 

strongly connected, 28, 41 

strongly k-connected, 28 

symmetric, 31, 58 

total, 48 

transitive, 82 

unicursal, 40 
Directed hamiltonian line, 47 
Directed incidence mapping, 24 
Directed tree, 29, 59 

root of, 29, 59 
Disarmament, 212 
Disconnected graph, 12 
Disconnecting set, 17, 268 
Distance between vertices, 56 
Distribution, exponential, 265 
Dodecahedron, 43 

rhombie, 43 


Dominating set of vertices, 54, 268 


Dominos, 203 

Donut, 94 

Dual graph, 73 

Dynamic programming, 63 


Economies, 114 

and logistics applications, 124 
Edge, 6 

adjacent, 4, 267 

directed, 24 

length of, 58 

parallel, 5, 9 
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Edge progression, 11, 26 
closed, 11, 26 
length of, 71 
open, 11, 26 
Edge sequence, 11 
Eigenvalues, 226 
Elapsed time, 130 
Electrical circuits, 179 
Electron spin, 202 
Elements, 179 
End point, 6, 9, 24, 268 
Engineering applications, 179 
Enumerable, 218 
Enumeration problems, 142 
Equilibrium basins, 207 
Ergodic Markov chain, 222 
Euclidean space, 4 
Euler circuit, 35 
Euler graph, 34, 37, 268 
directed, 37 
Buler’s formula, 68, 78, 93 
Event, 129 
Evolution, 207 
Existence of solution, 162 
Exponential distribution, 265 
Exposed vertex, 170 
Exterior graph, 71 
External stability, 166 


Factors of tree, 198 
Family group, 209 
Family tree, 32 
Fan, 148 
Feasible flow, 244 
Feedback, 114, 187 
Feedback arcs, 187 
Ferromagnetic, 202 
Figures, 145 
counting series for, 145 
Finite graph, 6, 32, 268 
Five-color theorem, 84 
Flow, 237, 268 
are, 236 
bounded, 240 
in bounded network, 251 
conformal, 240 
constrained, 236 
feasible, 244, 249 
jointly conformal set, 240 
maximal, 247, 249, 250 


Flow, minimal, 245, 249 
minimal-cost, 255 
multiple commodity, 261 
saturating, 252 
simple, 241 
stochastic, 263 
value of, 238 

Flow algorithm, 250 

Flow formulation, 242 

Flow inequality, 244 

Flow operation, 239 

Flower, 174 

Forest, 15, 268 

FORTRAN, 218 


Four color conjecture, 65, 82, 87, 90 


Four-colored graph, 88 
Free graph, 136 


Game, 153, 165 

board, 169 

of Gale, 168 

neutral, 168 

of Nim, 167 
Generator, 179 
Genes, 203 
Genetics, 203 
Genus, 94 
Geometric edge, & 
Geometric graph, 4, 7, 66 
Geometric n simplex, 92 


Geometric realization, 7, 8, 25, 268 


Geometric vertex, 4 

Global index, 185 

Glossary, 267 

Grammar, 214 

Graph, 6, 268 
abstract, 6 
activity, 129 
acyclic, 130 
associated incremental, 248 
associated undirected, 24, 28 
asymmetric, 32, 267 
of binary relation, 30, 269 
bipartite, 19, 86, 113, 267 
Cayley color, 224 
center of, 56 
chromatic number of, 55, 267 
complementary, 78, 267 


complete, 19, 80, 82, 136, 151, 267 


component of, 14, 102, 268 


Graph, connected (see Connected graphs) 

covering of, 34, 36 

eyclic, 27, 268 

cyclically k-partite, 226 

degenerate, 6 

degree of regular, 19 

diameter of, 57 

directed (see Directed graph) 

disconnected, 12 

Euler, 34, 37, 268 

exterior, 71 

finite, 6, 32, 268 

free, 136 

geometric, 4, 7, 268 

hypohamiltonian, 43 

inarticulate, 88, 269 

infinite, 6 

interior, 71 

intersect, 205 

interval, 203 

Ising, 202 

isomorphic, 7, 24, 142, 223, 269 
to within vertices of degree two, 70 

k-connected, 19, 28 

k-partite, 19, 55, 226 

k-regular, 19 

kinematic, 206 

Kuratowski, 66, 148 

nonplanar, 7, 65 

overlap, 205 

partite, 226 

planar (see Planar graphs) 

primitive, 111, 269 

project, 6, 129 

radius, of, 66 

reflexive, 31, 269 

regular, 19, 269 

rooted, 141 

separable, 67 

signal-flow, 186 

simple, 18, 26, 270 

strongly connected, 28, 41, 270 

symmetric, 31, 58, 270 

total, 48, 270 

transitive, 32, 270 

triangular, 89 

triangulated, 74 

two-connected, 19, 88 

unicursal, 34, 40, 179, 270 

unitary, 242 

(See also Directed graph) 
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Graph identification, 196 
Groups, 223 


Hamiltonian chain, 42, 269 
Hamiltonian circuit, 42, 269 
Hamiltonian cycle, 47, 269 
Hamiltonian line, 42 
directed, 47 
Hamiltonian path, 47, 231, 269 
Heawood’s formula, 94 
Heuristic algorithm, 196 
Hexane, 200 
Homeomorphism, 92, 208 
Human sciences, 205 
Hungarian tree, 174 
Husimi tree, 137 
Hydrocarbons, 199 
Hydrogen atoms, 199 
Hypohamiltonian graph, 43 


Identity permutation, 142 

Improvement chord, 61 

Inarticulate graph, 88, 269 

Incidence, 6, 269 

Incidence mapping, 6, 268 
directed, 24 
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Incidence matrix, 101, 115, 138, 182, 269 


rank of, 103 
relationship to trees, 107 
Independent set of vertices, 53, 269 
Index of primitivity, 111 
Industry, 124 
Infinite graph, 6 
Information, perfect, 166 
Initial vertex, 24 
Inner vertex, 172 
Input-output model, 114 
Inputs, 219 
Integration operator, 183 
Interconnection problem, 194 
Interindustry model, 127 
Interior graph, 71 
Intermediate vertices, 238 
Internal stability, 166 
Intersect graph, 205 
Intersections, 148 
Interval graph, 203 
Irreducible maps, 91 
Irreducible matrix, 111 
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Ising problem, 202 

Isolated vertex, 5 

Isomerism, 199 

Isomorphie graphs, 7, 24, 142, 223, 269 
directed, 24, 142 

Isomorphie vector spaces, 100 

Isomorphism, to straight graph, 74 
to within vertices of degree two, 70 


Jobs vs. machines, 171 
Jointly conformal flows, 240 
Jordan curve theorem, 69 
Junction points, 179 


k-color theorem, 86 
k-connected graph, 19, 28 
k-partite graph, 19, 55, 226 
k-regular graph, 19 

Kernel, 166 

Kinematic graph, 206 
Kirchhoff’s law, 181 
Kénigsberg bridge problem, 36 
Kuratowski graph, 66, 148 
Kuratowski subgraph, 69 
Kuratowski’s theorem, 65, 70 


Labelled graph, 136 
Labelled vertices, 141 
Language, 217 
Lattice, doubly periodic, 202 
n-dimensional, 202 
3-dimensional, 202 
Length, of an are, 58 
of an are progression, 26, 111 
of an edge, 58 
of an edge progression, 11 
Lexicographical ordering, 198 
Line, hamiltonian, 42 
Line graph, 74 
Linear independence, 116 
Linear model, 151 
Linear ordering, 196, 228 
Linear programming, 128, 260 
Linear systems, 189 
Linguistics, 214 
Linkage, 203 
Longest circuit, 42 
Loop, 9, 269 
effect in network, 190 


Machine, 219 

complete, 219 

deterministic, 219 

memoryless, 220 

sequential, 219 

stationary, 220 

strongly connected, 221 
Map, 6, 85, 91, 93, 119, 197 
Mapping, directed incidence, 24 

incidence, 6, 268 
Mark Twain, 206 
Markov chain, 208, 221 
Markov process, 223 
Matching, 170, 269 

maximal, 170 

perfect, 170 
Matching edges, 171 
Mathematical machines, 219 
Matrix, adjacency, 109, 226 

branch capacity, 185 

circuit, 104, 108, 269 

cut-set, 105, 109, 268 

incidence, 101, 103, 107, 115, 138, 182, 

269 

irreducible, 111 

nonnegative, 226 

primitive, 111 

stochastic, 126 

substochastic, 125 

terminal capacity, 185 

transition, 164 

vertex, 109, 160, 169, 208, 270 
Matrix solution of crossing problem, 160 
Matroid, 116 
Max-flow min-cut theorem, 250 
Maximal connected subgraph, 14, 103 
Maximal degree-constrained subgraph, 

178 

Maximal flow, 247, 249, 250 
Maximal gain, 87 
Maximal independent set of vertices, 53 
Maximal map, 91 
Maximal matching, 170 
Maximal number of comparisons, 230 
Maximal shortest-distance tree, 59 
Maximal subgraph, 125 
Maximal weight sum subgraph, 194 
Maximizing flow, 236 
Maze, 159 
Memoryless machines, 220 
Men vs. jobs, 171 


Men vs. women, 171 
Menger-Dirac theorem, 67 
Mesh system equations, 183 
Minimal brick factory catastrophes, 147 
Minimal cost flow, 255, 257 
Minimal covering, 34, 37 
Minimal flow, 245, 249 
Minimal intersections in complete graphs, 
151 
Minimal length tree, 225 
Minimization, 128 
Minor of matroid, 117 
Missionary and cannibal problem, 155 
Mixed star tree, 137 
Mobius strip, 92 
Mod 2 operations, 100 
Molecular bonds, 199 
Molecules, diatomic, 202 
triple bonded, 202 
Monic polynomial, 227 
Monochromatic triangles, 80 
Multiple commodity flow, 261 


n-dimensional manifold, 92 
Negative degree of a vertex, 25, 37, 269 
Negative incidence, 25, 269 
Net output, 237 
Network, 237, 269 
capacitated, 244, 250, 267 
cascade, 187 
closed, 192 
communication, 184 
flow in, 128, 237 
open, 192 
path inversion in, 188 
PERT, 33, 128, 269 
relation to physical systems, 187 
residual, 188, 190 
signal flow, 186 
stochastic flow in, 263 
switching, 192 
Network flows, 128, 236, 263 
Neutral game, 168 
Nim, game, 167 
Nodal system equations, 183 
Nonplanar graph, 7, 65 
Nonplanarity, of complete graph on five 
vertices, 69 
of three utilities graph, 69 
Nonprimitive graph, 113 
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Norm, 213 

Normal are, 241 
Nucleus, 166 
Number of paths, 110 


One-to-one transformation, 92 

Open are progression, 26 

Open curve, simple, 4 

Open edge progression, 11, 26 

Open network, 192 

Operand, 205 

Operator. 205 

Order of subgraph, 111 

Ordered pair, 5 

Organic chemistry, 196 
applications, 199 

Organization, 184 

Orientable surface, 92, 95 

Oriented surface, 96 

Orthogonality relation, 106 

Outer vertex, 172 

Output, 219 

Output function, 219 

Ovens, 147 

Overlap graph, 205 


Paraffin series, 199 
Parallel ares, 24, 269 
strictly, 25 
Parallel edges, 5, 9, 269 
Partite graph, 226 
Path, 27, 158, 167 
of disarmament, 214 
and elapsed time, 130 
hamiltonian, 47 
shortest, 59 
simple, 27, 270 
Path inversion, 188 
Path matrix, 115 
rank of, 116 
Path progression, 27, 269 
simple, 27 
Paving problem, 169, 203 
Pentane, 200 
Perfect matching, 170 
Permutation, 142 
degree of, 142 
Permutation matrix, 227 
Permutation type, 143 
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PERT, 128, 134 
PERT network, 33, 129, 269 
Physical science applications, 196 
Physical systems, 179, 187, 202 
Planar graphs, 7, 65, 66, 88, 148, 195, 269 
characterization of, 70, 73 
cut-sets and, 106 
Planning, 128 
Planted tree, 173 
root of, 173 
Platforms, 147 
Point of articulation, 67, 267 
Poisson process, 265 
Polya’s theorem, 142, 145 
Polyhedral graph, 77 
Polyhedron, convex, 56 
Positive degree of a vertex, 25, 37, 269 
Positive incidence, 25, 269 
Positive trees, 127 
Postman, Chinese, problem of, 179 
Postulates and graphs, 164 
Power stations, 194 
Primitive graph, 111, 269 
Primitive matrix, 111 
Principal diagonal, 103 
Printed circuits, 195 
Prisms, 78 
Probability and planning, 133 
Project evaluation, 128 
Project graph, 6, 129 
Proper cut-set, 17, 20, 29, 269 
Properly colorable, 85, 87 
Propositions and graphs, 164 
Pseudosymmetric directed graph, 37 
Puzzles and games, 153, 170 


Queues, 263 


Radius, of directed graph, 58 

of graph, 66 
Rail lines, 147 
Rank, 100, 104 

of circuit matrix, 107 

of incidence matrix, 103 

of path matrix, 116 
Ranking problems, 228 
Reach of a vertex, 114 
Realizebility of a matrix, 101 
Realizable matrix, 185 


Realization, geometric, 7, 8, 25 
Rectangular coordinate representation, 
150 

Reduction, 117 
Reference vertex, 59 
Reflexive binary relation, 14, 31 
Reflexive directed graph, 31, 269 
Regional adjacencies, 83 
Regions, coloring of, 82 
Regular ergodic chain, 223 
Regular graph, 19, 269 
Relation, binary, 14, 30 
Residual network, 188, 190 
Residues, 224 

classes of, 224 
Responses, 221 
Rhombie dodecahedron, 43 
Rigidity, 197 
Root, 173 

of a directed tree, 29, 59, 269 
Rooted graph, 141 
Rooted tree, 197, 200, 269 
Rule book, 165 


Saturating flow, 252 
Sector of economy, 124 
Self-complementary graph, 78 
Separable graph, 67 
Sequential machines, 219 
Service times, 265 
Shannon's game, 167 
Short player, 167 
Shortest-distance tree, 49 
Shortest hamiltonian circuit, 48 
Shortest path, 59 

algorithm for, 60 
Shrinking, of circuits, 172 

of feedback units, 188 
Signal flow, 186 
Signal-flow graphs, 186 
Signed tree, 137 
Simple chain, 12, 20, 269 
Simple chain progression, 12 
Simple circuit, 12, 269 
Simple circuit progression, 12 
Simple closed curve, 4 
Simple cycle, 27, 207, 270 
Simple cycle progression, 27 
Simple directed graph, 26 
Simple flows, 241 


Simple graph, 18, 26, 270 
Simple open curve, 4 
Simple path, 27, 270 
Simple path progression, 27 
Simple triangulated graph, 76 
Simplex, 77 
Simplicial complex, 92 
Simultaneous equations, 190 
Single-valued transformation, 206 
Sink, 238, 270 
Slack, 131 
Social science applications, 209 
Source, 238, 270 
Spanning tree, 15, 20, 29, 182, 225, 270 
Sphere, two color theorem on, 119 
Splitting operation, 188 
Stability, 212 
Star, 136 
Star tree, 137 
State, 155, 165, 219 

transient, 221 

trapping, 222 
Stationary machine, 220 
Stationary Markov process, 222 
Statistical mechanics, 202 
Steinhaus procedure, 228 
Stereographic projection, 66, 71 
Stochastic flow, 263 
Stochastic matrix, 126 
Straight chain hydrocarbons, 200 
Straight graph, 74 
Strictly parallel arcs, 25 
Strings in a language, 214 
Strongly connected directed graph, 28, 

41, 111, 270 

Strongly connected machine, 221 
Strongly k-connected directed graph, 28 
Subgraph, 10, 270 

maximal connected, 14 
Substochastic matrix, 125 
Supergraph, 10 
Surfaces, 92 

with holes, 93 
Switching function, 192 
Switching game, 167 
Switching network, 192 
Switching variables, 192 
Symmetric binary relation, 14, 31 
Symmetric directed graph, 31, 58, 270 
Symmetry number of graph, 224 
Synthesis of communication network, 185 
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System transition problems, 155 
Systems, mixed, 179 

physical, 179 

pure, 179 


Tape, 221 
Tasks, 211 
Terminal capacity matrix, 185 
Terminal vertex, 24, 189, 270 
Terminals, 167 
Three color theorem, 89 
Three utility-three house graph, 67 
Through variable, 179 
Torus, 202 
Total directed graph, 48, 270 
Transactions, 114 
Transform, 205 
Transformed graph, 175 
Transient state, 221 
Transition, 205 
Transition function, 219 
Transition matrix, 164 
Transition problems, 155 
Transitive binary relation, 14, 32 
Transitive directed graph, 32, 270 
Transitivity, 111 
Transportation, 194 
Transportation problem, 128, 260 
Trapping set, 222 
Traveling salesman problem, 48, 59, 194 
Tree identification, 196 
Trees, 15, 29, 59, 86, 270 
alternating, 172 
branch of, 15, 267 
chord of, 14, 267 
directed, 29, 59 
edge disjoint, 167 
factors of, 198 
hungarian, 174 
maximal shortest-distance, 69 
minimal length, 225 
mixed star, 137 
nonsingular submatrices and, 107 
number of, 137 
planted, 173 
positive, 127 
rooted, 29, 59, 197, 200, 269 
shortest-distance, 69 
signed, 137 
spanning, 15, 20, 29, 270 


294 Index 


Triangle division problem, 165 
Triangular graph, 89 
Triangulated graph, 74 
Triple bonded molecules, 202 
Trivalent graph, 74 
Trivalent map, 87 
Two-color theorem, 85 

on sphere, 119 
Two-connected graph, 19, 88 
Two-person games, 165 
Two-terminal elements, 179 


Unary operation, 217 

Undirected graph (see Graph) 

Unicursal graph, 34, 40, 179, 270 
directed, 40 

Unlabelled graph, 136 

Unordered pair, 6 

Unordered product of a set with itself, 6 

Unorientable surface, 92 

Unitary graph, 242 

Utility graph, 67, 79 


Value of flow, 238 
Value of input, 124 
Vector space, 100 


Venn diagram, 224 
Vertex, 6, 24, 270 
adjacent, 5, 9, 267 
degree of, 9, 25, 35, 268 
initial, 24 
inner, 172 
isolated, 5 
negative degree of, 25, 37, 269 
outer, 172 
positive degree of, 25, 37, 269 
reach of, 114 
reference, 59 
terminal, 24, 189, 270 
Vertex matrix, 109, 160, 169, 208, 270 
(See also Adjacency matrix) 
Vertex postulate, 180 
Vertices, independent set of, 63, 260 
maximal, 53 
Voltage generator, 180 


Waiting line, 263, 265 

Weapon system, 212 

Weight, of pair of edges, 81 
of vertex, 187 

Wheel configuration, 210 

Winning state of a game, 166 


